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Increasing  power  density  is  a  daunting  challenge  for  continued  MOSFET 
scaling  due  to  non-scalability  of  the  thermal  voltage  k^Ttq.  To  circumvent  this 
CMOS  power  crisis  and  to  allow  for  aggressive  supply  voltage  reduction, 
alternative  switching  device  designs  have  been  proposed  and  demonstrated  to 
achieve  steeper  than  60mV/dec  subthreshold  swing  ( S ).  This  dissertation  begins 
with  a  general  overview  of  the  physics  and  operation  of  these  MOSFET- 
replacement  devices.  It  then  applies  circuit-level  metrics  to  establish  evaluation 
guidelines  for  assessing  the  promise  of  these  alternative  transistor  designs. 

This  dissertation  then  investigates  the  abrupt  “pull-in”  effect  of  an 
electrostatically  actuated  beam  to  achieve  abrupt  switching  behavior  in  the  nano- 
electro-mechanical  field  effect  transistor  (NEMFET).  To  facilitate  low-voltage 
NEMFET  design,  the  Euler-Bernoulli  beam  equation  is  solved  simultaneously  with 
the  Poisson  equation  in  order  to  accurately  model  the  switching  behavior  of  a 
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NEMFET.  The  impact  of  various  transistor  design  parameters  on  the  gate  pull-in 
voltage  and  release  voltage  are  examined.  A  unified  pull-in/release  voltage  model 
is  developed. 

Finally,  this  dissertation  proposes  the  use  of  micro-relays  for  zero-standby- 
power  digital  logic  applications.  To  mitigate  the  contact  reliability  issue,  it  is 
demonstrated  that  since  relatively  high  on-state  resistance  can  be  tolerated  while 
extremely  high  endurance  is  a  necessity,  hard  contacting  electrode  materials  and 
operation  with  low  contact  force  are  preferred  for  reliable  circuit  operation.  Using 
this  contact  design  approach,  a  reliable  relay  technology  that  employs  titanium 
dioxide  (TiCfi)  coated  tungsten  (W)  electrodes  is  developed  for  digital  logic 
applications.  Relay  miniaturization  will  lead  to  improvements  in  density  (for  lower 
cost  per  function),  switching  delay  (for  higher  performance),  and  power 
consumption.  A  scaled  relay  technology  is  projected  to  provide  >10x  energy  savings 
for  digital  circuits  operating  at  up  to  ~l  00MHz. 
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Chapter  1 


Introduction: 

The  CMOS  Power  Crisis 

1.1  CMOS  Scaling  Trend 

The  steady  reduction  in  the  dimensions  of  complementary  metal-oxide- 
semiconductor  (CMOS)  transistors  from  one  technology  to  another  has  provided 
for  dramatic  improvements  in  the  switching  speed,  density,  cost  and  functionality 
of  CMOS  chips.  As  shown  in  Fig.  1.1,  the  physical  gate  length  of  a  CMOS 
transistor  has  been  reducing  at  an  exponential  rate  and  is  expected  to  be  scaled 
down  to  the  sub-20nm  regime  in  year  2010  [1.1].  However,  due  to  the  fact  that  the 
thermal  voltage  k^T/q  does  not  scale,  the  threshold  voltage  (Ft)  of  CMOS 
transistors  can  no  longer  be  reduced  along  with  their  lithographic  dimensions.  This 
non-scaling  of  the  threshold  voltage  forces  the  supply  voltage  (Fdd)  to  remain 
constant  across  technologies  for  a  given  switching  speed,  as  shown  in  Fig.  1 .2,  Fdd 
has  saturated  at  around  IV  from  the  130nm  technology  forward  [1.1].  Therefore, 
the  power  density  of  integrated  circuits  has  increased  drastically.  As  an  example, 
the  power  dissipation  in  Intel’s  state-of-the-art  microprocessors  has  already  reached 
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the  level  of  100W  or  more  [1.2].  Power  is  now  a  major  constraint  for  modem  day 
CMOS  chip  design. 


Year  of  Production 

Fig.  1.1  Improvements  in  the  switching  speed,  density,  cost  and  functionality  of 
CMOS  chips  have  been  enabled  by  the  steady  miniaturization  of  the  transistor  over 
the  past  four  decades.  By  the  year  2010,  the  physical  gate  length  of  a  CMOS 
transistor  is  expected  to  be  scaled  down  to  the  sub-20nm  regime  [1.1]. 
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Technology  Generation  [pm] 

Fig.  1.2  To  maintain  constant  power  density,  both  the  supply  and  the  threshold 
voltages  of  a  CMOS  integrated  circuit  should  be  reduced  along  with  the 
lithographic  dimension  of  CMOS  transistors.  But  due  to  MOSFET  subthreshold 
leakage,  both  Vm  and  FT  scaling  have  slowed  down  in  recent  technology 
generations  [1.1]. 

1.2  MOSFET  Physics  in  the  Subthreshold  Regime 

The  CMOS  power  crisis  is  fundamentally  due  to  the  non-scaling  of  the 
thennal  voltage  k^Tlq,  which  sets  a  lower  limit  for  the  subthreshold  swing  ( S)  of  a 
MOSFET.  Fig.  1.3  shows  the  /ds-FgS  characteristics  of  a  MOSFET.  Below  the 
threshold  voltage  (Ft),  the  MOSFET  does  not  turn  off  completely;  instead,  7ds 
decreases  exponentially  with  Vgs  with  an  inverse  slope  (“subthreshold  swing”)  S  > 
60mV/dec  at  room  temperature.  Thus,  in  the  off  state  (Fgs=0V),  CMOS  transistors 
still  dissipate  leakage  energy. 
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0  0.2  0.4  0.6  0.8  1 

Gate  Voltage  Vg 

Fig.  1.3  The  subthreshold  swing  S  of  a  MOSFET  is  limited  by  the  thennal  voltage 
kBT/q;  it  is  greater  than  or  equal  to  60mV/dec  at  room  temperature. 


Fig.  1.4  The  potential  barrier  for  thennionic  emission  can  be  modulated  (A(j)s)  by 
the  gate  voltage;  and  according  to  Boltzmann  statistics  [1.3],  electron  concentration 
is  exponentially  proportional  to  q(j)s/A:BT.  This  sets  a  lower  limit  for  the  MOSFET 
subthreshold  swing,  which  is  at  least  60mV/dec  at  room  temperature. 
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The  origin  of  S  >  60mV/dec  can  be  explained  by  the  electron  energy  band 


profile  of  a  MOSFET,  which  is  shown  in  Fig.  1.4.  As  Fgs  increases,  the  channel 
potential  is  modulated  by  the  capacitive  divider: 

_  Cox 


for  F5S  <  VT 


(1.1) 


&Vgs  Cox+C  dep 

where  Cox  and  Cdep  are  the  oxide  and  depletion  capacitances,  respectively. 
Increasing  the  channel  potential  (<j)s)  reduces  the  potential  barrier  for  electron 
injection,  and  hence  the  electron  energy  ( E)  increases.  According  to  the  Boltzmann 
statistics  [1.3],  electron  concentration  n(E)  increases  exponentially  with  electron 
energy;  therefore,  the  drain-to-source  current  flow  (7ds)  depends  exponentially  on 
channel  potential: 

Ids  «  n(E)  oc  exp  (£)  oc  exp  (0)  oc  exp  (1.2) 


And  the  subthreshold  swing  of  the  MOSFETs  can  therefore  be  expressed  as: 

£  —  |  d  t°@io  _  d  log  10  Ids  ^  d(ps  ln(\ff)  ^  _|_  C dep\  / j 

\  dVgs  )  d<ps  dVgS  q  V  Cox  ' 

which  is  at  least  60mV/dec  at  room  temperature.  Due  to  the  non-zero  depletion 
capacitance  in  the  transistor,  S  is  typically  ~  1 OOmV/dec  for  state-of-the-art 
MOSFETs. 


The  subthreshold  swing  determines  the  lower  energy  limit  for  CMOS 
electronics.  Fig.  1.5  illustrates  the  dependence  of  MOSFET  energy  consumption  on 
the  power  supply  voltage  for  a  given  switching  speed.  As  the  supply  voltage  is 
scaled  down,  the  dynamic  energy  (~CV dd")  reduces  quadratically;  but  to  maintain 
the  same  switching  speed  (oc  ( Kid-  Ft)),  Ft  must  be  decreased  together  with  Fid  to 
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maintain  the  same  on-state  current.  This,  as  a  result,  exponentially  increases  off- 
state  leakage  (Eqn.  1.2)  and  static  energy.  To  reach  the  minimum  operation  energy, 
the  dynamic  and  the  leakage  energies  must  be  properly  balanced;  and  for  most 
digital  designs,  this  optimal  ratio  is  roughly  30-50%  [1.4].  As  previously  alluded  to, 
both  the  Vdd  and  Ft  have  remained  roughly  unchanged  from  the  130nm  technology 
node  and  onwards;  therefore  the  CMOS  energy  efficiency  has  not  improved 
proportionately  as  the  transistor  dimensions  have  been  scaled  down. 


2.5 


0) 

o  2.0 
_o 
U) 

a>  1.5 

£ 

LU 

T3 

<D 

3S  1.0 

re 
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0.5 


0.1  0.2  0.3 

Vdd[V] 


Fig.  1.5  Dynamic  energy  reduces  quadratically  as  the  supply  voltage  is  scaled 
down;  but  to  maintain  a  certain  switching  speed,  the  threshold  voltage  of  the 
MOSFET  must  be  scaled  down  as  well,  which  increases  the  leakage  energy. 
Therefore  there  exists  an  optimal  Fdd  that  minimizes  the  energy  dissipation. 
Transistor  designs  with  lower  S  value  reduce  the  leakage  energy;  they  therefore 
improve  the  energy  efficiency. 
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1.3  Various  MOSFET  Replacement  Devices 


To  overcome  the  CMOS  energy  efficiency  limit,  alternative  transistor 
designs  which  can  achieve  a  steeper  sub-threshold  swing  ( i.e .  more  abrupt 
transition  between  on-  and  off-states)  have  been  proposed.  As  shown  in  Fig.  1.5, 
transistor  designs  with  lower  S  value  reduce  the  leakage  energy;  this  allows  for 
more  aggressive  Vdd  scaling  and  improvement  in  the  energy  efficiency. 

To  reach  this  goal,  alternative  transistor  designs  such  as  the  tunneling  based 
field  effect  transistors  [1.5,  1.6],  impact  ionization  MOS  [1.7,  1.8],  ferroelectric 
FETs  [1.9,  1.10]  and  electromechanical  devices  [1.11-1.17]  have  been  proposed 
and  demonstrated  to  achieve  sub  threshold  swing  ( S)  <  60mV/dec.  Among  these,  the 
tunneling  field  effect  transistor  (TFET)  and  electromechanical  devices  show  the 
most  promise  for  low  power  electronics  applications. 

1.3.1  Tunneling  Field  Effect  Transistor  (TFET) 

Among  all  the  alternative  transistor  designs,  the  tunnel  field  effect  transistor 
(TFET)  shows  the  most  promise  due  to  its  relative  simplicity  and  resemblance  to 
the  conventional  MOSFET.  The  TFET  utilizes  band-to-band  tunneling  (BTBT) 
current  to  achieve  a  more  abrupt  on-to-off  transition  than  what  is  achievable 
through  thennionic  emission.  Fig.  1.6  shows  the  energy  band  diagram  of  the  TFET 
in  on  and  off  states.  In  the  off  state,  the  wide  energy  barrier  prohibits  quantum 
tunneling  between  the  source  and  channel  regions.  When  a  large  Vgs  is  applied,  the 
energy  barrier  narrows,  and  allowed  energy  states  in  the  channel  conduction  band 


7 


align  with  allowed  energy  states  in  the  source  valence  band,  so  that  electrons  can 
tunnel  from  the  source  to  the  channel.  Since  the  TFET  utilizes  a  different  source 
injection  mechanism  from  the  MOSFET,  it  can  potentially  achieve  lower  S  values, 
which  has  already  been  experimentally  demonstrated  [1.5].  Note,  however,  that  a 
TFET  achieves  5<60mV/dec  only  at  low  current  levels  and  that  S  increases  as  7ds 
increases.  Consequently,  at  high  Fdd  (~1V)  values,  a  silicon  TFET  has  a 
significantly  lower  on-state  current  7on  (~1  pA/pm  at  IV)  than  a  silicon  MOSFET 
(lmA/pm  at  IV).  This  remains  a  principal  challenge  for  TFET  designers. 


OFF  State  ON  State 

Fig.  1.6  Schematic  diagram  of  a  tunneling  field  effect  transistor  and  its  energy  band 
diagram  in  the  off  and  on  states  (n-channel  operation). 

1.3.2  Electromechanical  Devices 

Besides  band-to-band  tunneling,  the  abrupt  “pull-in”  effect  in  micro¬ 
electromechanical  systems  (MEMS)  has  also  been  harnessed  to  realize  new 
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switching  device  designs  with  higher  I0JI0ff  ratio  for  a  given  gate  voltage  swing. 
These  devices  utilize  a  movable  beam  for  switching,  and  they  can  roughly  be 
divided  into  two  categories:  the  nano-electro-mechanical  field  effect  transistor 
(NEMFET)  and  the  micro-electro-mechanical  relay  (micro-relay). 
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Fig.  1.7  Schematic  diagram  of  a  nano-electro-mechanical  n-channel  enhancement 
mode  field  effect  transistor  in  off  and  on  states. 

a.  Nano-Electro-Mechanical  Field  Effect  Transistor  (NEMFET) 

A  nano-electro-mechanical  field  effect  transistor  [1.11-1.13]  is  essentially  a 
MOSFET  with  a  movable  gate  electrode  which  can  be  physically  separated  from 
the  gate  dielectric  layer  by  an  air  gap  (or  vacuum  gap).  As  shown  in  Fig.  1.7,  the 
gate,  which  is  a  mechanical  beam  anchored  on  both  sides  of  the  semiconducting 
channel,  can  be  modelled  as  a  simple  linear  spring  (with  a  characteristic  spring 
constant  k)  suspended  over  the  semiconductor  channel.  The  gate  and  the  channel 
form  a  parallel-plate  capacitor  with  an  equivalent  air-gap. 
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In  the  off-state  (Fgs=0V),  the  gate  is  separated  from  the  gate  dielectric;  the 
gate  coupling  to  the  channel  is  weak  and  the  transistor  is  therefore  turned  off.  When 
a  positive  Vgs  is  applied,  the  electrostatic  force  attracts  the  mechanical  gate  towards 
the  gate  dielectric.  While  the  electrostatic  force  increases  quadratically  with 
increasing  displacement,  the  spring  restoring  force,  which  counteracts  the 
electrostatic  force,  increases  only  linearly  with  displacement.  Hence,  there  is  a 
critical  pull-in  voltage  ( Vpi)  beyond  which  the  electrostatic  force  is  always  larger 
than  spring  restoring  force,  causing  the  gap  to  close  abruptly.  When  the  gate  is  in 
contact  with  the  gate  dielectric,  the  gate  coupling  to  the  channel  is  maximized  and 
the  transistor  is  turned  on.  Taking  advantage  of  this  pull-in  phenomenon,  a 
NEMFET  with  perfectly  abrupt  switching  transition  (S^OmV/dec)  at  Vgs=Vpi  have 
been  utilized  for  logic,  memory  and  resonator  applications  [1.11-1.13]. 

b.  Micro-electro-Mechanical  Relay  (Micro-Relay) 

The  abrupt  pull-in  effect  has  also  been  harnessed  for  micro-electro- 
mechanical  relays  (“micro-relays”)  [1.14-1.17].  The  attractiveness  of  micro-relays 
stems  from  the  fact  that  a  mechanical  switch  offers  nearly  ideal  switching 
characteristics:  zero  off-state  drain-to-source  and  gate  leakage  currents,  and  perfectly 
abrupt  off-to-on  transition.  Since  there  is  no  trade-off  between  off-state  leakage 
current  and  on-state  drive  current,  the  relay  threshold  voltage  and  therefore  Vdd  can 
in  principal  be  reduced  much  more  aggressively  than  for  MOSFETs,  potentially 
leading  to  improved  energy  efficiency. 
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In  terms  of  device  structure  and  operation  (shown  in  Fig.  1.8),  a  micro-relay 
for  digital  logic  applications  (“logic  relays”)  is  very  similar  to  one  targeted  for 
radio-frequency  signal  DC  switching  applications  (“RF  relays”).  In  the  off  state,  an 
air  gap  separates  the  source  from  the  metallic  drain  electrode  so  that  no  current  can 
flow.  In  the  on  state  where  the  gate-to-source  voltage  is  greater  than  the  pull-in 
voltage  (Fpi),  the  source,  which  is  a  movable  beam,  comes  down  and  touches  the 
drain  electrode,  providing  a  conductive  path  for  current  to  flow.  Since  the  relay 
switches  on  abruptly  as  Vgs  is  increased  above  Fpi,  the  h-Vg  characteristic  of  the 
relay  exhibits  an  extremely  steep  (nearly  infinite)  subthreshold  slope. 


Air  gap  1  1 

1  1  □  1 

Source  Gate  Drain 
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Source  Gate  Drain 
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Fig.  1.8  Schematic  diagram  of  a  micro-relay  in  off  and  on  states 

1.4  Objectives 

This  research  focuses  on  the  analysis,  design  and  applications  of  MOSFET- 
replacement  devices,  with  emphasis  on  the  TFET  and  electromechanical  devices. 
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As  alluded  to  previously,  TFETs  often  have  small  S  value  at  low  current  levels  but 
fail  to  achieve  the  required  on-to-off  current  ratio  across  a  range  of  Vdd-  To 
investigate  whether  TFETs  can  effectively  replace  MOSFETs,  one  needs  to 
compare  the  energy-performance  tradeoff  of  a  TFET  with  that  of  a  MOSFET.  To 
achieve  this  goal,  chapter  2  first  reviews  the  energy-performance  tradeoffs  of 
CMOS,  and  shows  that  the  optimal  /0n//0ff  value  for  most  CMOS  replacement 
devices  at  the  optimum  energy  depends  only  on  the  average  activity  factor  and  the 
logic  depth,  and  that  its  value  is  roughly  insensitive  to  all  other  device  parameters. 
Thus,  it  is  the  device’s  effective  sub  threshold  swing  (5efr)  over  a  range  of  voltage 
rather  than  the  steepest  local  subthreshold  swing  ( S)  value  that  determines  the 
energy  efficiency.  With  this  in  mind,  simple  guidelines  for  assessing  the  energy 
efficiency  of  CMOS  replacement  are  established.  As  a  concrete  example,  we  use 
this  methodology  to  compare  TFETs  against  CMOS,  showing  that  TFETs  may 
offer  substantial  (~5x)  energy  savings  for  perfonnance  up  to  the  100MHz  range. 

To  alleviate  the  issue  of  S  degradation  at  high  current  level,  the  use  of  the 
abrupt  gate  pull-in  effect  in  NEMFET  to  achieve  the  required  Ion/I0ff  ratio  with  a 
smaller  Vdd  appears  to  be  an  attractive  solution.  To  facilitate  this  goal,  NEMFET 
device  physics  and  operation  are  studied  in  Chapter  3.  Due  to  the  beam  bending  of 
the  mechanical  gate,  the  channel  potential  is  non-uniform  in  the  semiconductor. 
The  Euler-Bernoulli  beam  equation  is  solved  simultaneously  with  the  Poisson 
equation  in  order  to  accurately  model  the  switching  behavior  of  NEMFETs.  Using 
this  approach,  the  shape  of  the  movable  gate  electrode  and  semiconductor  potential 
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across  the  width  of  the  channel  are  derived  for  the  various  regimes  of  transistor 
operation  (before  gate  pull-in,  after  gate  pull-in,  and  at  the  point  of  gate  release). 
The  impact  of  various  transistor  design  parameters  such  as  the  body  doping 
concentration,  gate  work  function,  gate  stiffness,  and  as-fabricated  actuation  gap 
thickness,  as  well  as  source-to-body  bias  voltage  and  surface  forces,  on  the  gate 
pull-in  voltage  and  gate  release  voltage  are  examined.  A  unified  pull-in/release 
voltage  model  is  developed,  to  facilitate  NEMFET  design  for  digital  and  analog 
circuit  applications. 

Although  the  pull-in  effect  can  be  harnessed  to  achieve  a  perfectly  abrupt 
off-to-on  switching  transition  for  NEMFET;  the  presence  of  the  air-gap  in  the  off- 
state  also  severely  decreases  the  gate-to -channel  capacitive  coupling  in  the  off- 
state,  limiting  NEMFET  scalability.  In  light  of  this  limitation,  chapter  4  discusses 
the  use  of  micro-relays  for  zero-standby  power  logic  applications.  Contact  design 
techniques  to  achieve  reliable  (high-endurance)  micro-relay  operation  are 
described.  Utilizing  TiCF-coatcd  tungsten  contacting  electrodes,  prototype  relays 
fabricated  using  a  CMOS-compatible  process  are  demonstrated  to  operate  with  low 
surface  adhesion  force,  adequately  low  on-state  resistance  (<  I  OOkQ)  over  a  wide 
temperature  range  (20°C-200°C),  and  >109  on/off  switching  cycles  in  N2  ambient 
without  stiction-  or  welding  induced  failure.  These  results  pave  the  pathway  to 
realizing  reliable  micro-relays  for  digital  logic  applications. 

Using  calibrated  relay  delay  and  energy  models,  a  sensitivity-based  relay 
energy-delay  optimization  methodology  is  developed  in  Chapter  5,  in  which  simple 


13 


relay  design  guidelines  are  established.  The  proposed  scaling  methodology  for 
micro-relays  is  then  presented,  which  leads  to  systematic  improvements  in  device 
density,  performance,  and  energy  consumption.  Simulation  results  indicate  that 
scaled  relay  technology  can  potentially  offer  >10x  improvement  in  energy  efficiency 
for  applications  requiring  performance  up  to  -lOOMHz. 

Chapter  6  summarizes  the  key  results  and  contributions  of  this  dissertation; 
future  research  directions  are  also  suggested. 
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Chapter  2 


Circuit-Driven  Requirements  for 
MOSFET-Replacement  Devices 

2.1  Introduction 

As  alluded  to  in  Chapter  1,  many  alternative  switching  devices  [2.1-2.14] 
have  been  proposed  and  demonstrated  to  achieve  subthreshold  swing  ( S ) 
<60mV/dec  to  allow  for  power  supply  ( Fid)  and  threshold  voltage  (Ft)  scaling  to 
alleviate  the  CMOS  power  crisis.  However,  many  of  these  devices  (e.g.  the  TFET) 
achieve  S<60mV/dcc  only  at  low  on-current  levels,  and  fail  to  maintain  improved 
I oJ  I  off  across  a  range  of  Fdd-  In  addition,  some  of  these  devices  (e.g.,  the  NEMFET 
and  NEM  relay)  do  not  begin  to  conduct  current  until  sometime  after  the  control 
voltage  arrives,  leading  to  an  additional  delay.  To  investigate  whether  these  devices 
can  effectively  replace  MOSFETs,  one  needs  to  compare  the  energy-perfonnance 
tradeoff  of  these  new  logic  devices  with  that  for  MOSFETs.  To  achieve  this  goal, 
this  chapter  starts  by  describing  the  energy-performance  tradeoffs  of  CMOS  gates 
in  section  2.2.  It  is  then  shown  in  section  2.3  that  for  a  given  performance  target, 
the  optimal  /on//0rr  value  of  CMOS  at  the  optimum  energy  depends  only  on  the 
average  activity  factor  and  the  logic  depth.  In  section  2.4,  this  optimal  /on//oir  value 
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is  shown  to  remain  roughly  the  same  for  most  CMOS  replacement  devices.  With 
this  optimal  I0JI0n  fixed,  it  is  shown  in  section  2.5  that  the  device’s  effective 
subthreshold  swing  (Ses)  over  a  range  of  voltage  (rather  than  the  steepest  local 
subthreshold  swing  ( S)  value)  determines  the  device’s  energy  efficiency.  With  this 
in  mind,  simple  guidelines  for  assessing  the  energy  efficiency  of  MOSFET 
replacement  devices  are  then  established.  Finally,  as  a  concrete  example,  this 
methodology  is  used  to  compare  TFETs  against  CMOS,  showing  that  TFETs  may 
offer  substantial  (~5x)  energy  savings  for  performance  up  to  the  100MHz  range. 

2.2  Simplified  Energy-Performance  Analysis 

Although  digital  chips  clearly  consist  of  a  broad  variety  of  circuit  types,  the 
tradeoffs  between  energy  and  delay  for  the  majority  of  CMOS  gates  on  a  chip  are 
similar  to  those  of  an  inverter.  Therefore,  at  least  for  devices  whose  qualitative 
behavior  is  similar  to  a  MOSFET,  we  can  approximately  compare  the  energy  and 
delay  tradeoffs  by  using  an  inverter  chain,  as  shown  in  Fig.  2.1.  The  total  energy 
consumption  per  operation  of  an  inverter  chain  with  logic  depth  L(i  ,  average 
activity  factor  a  ,  electrical  fanout  (FO)  f  and  capacitance/stage  C  can  be  computed 
by  adding  the  dynamic  and  the  leakage  energy  components: 

E  —  Edyn  "F  £)ect/c  —  ^-^dYdd^  ‘  f  F  ^df^off^dd^delay  (2.1) 

where  the  /delay  is  simply: 

+  _  LdC'fVdd 

t delay  ~  2i  Ww 
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Fig.  2.1  Zd-stage  inverter  chain  considered  for  energy  efficiency.  For  most  digital 
circuit,  the  energy  delay  tradeoffs  of  CMOS  gates  are  similar  to  the  tradeoff  for  an 
inverter  chain. 

Optimized  circuit  design  entails  the  choice  of  parameters  such  as  supply 
(Vdd)  and  threshold  voltage  (Ft)  that  minimize  the  energy  dissipation  (Eqn.  2.1) 
subject  to  a  given  delay  target  (Eqn.  2.2).  To  reach  this  goal,  the  dynamic  and  static 
energies  must  be  properly  balanced.  For  CMOS  circuits,  dynamic  energy  can  be 
reduced  quadratically  by  decreasing  Fdd.  However,  in  order  to  avoid  increased 
circuit  delay,  Ft  must  be  decreased  along  with  Vdd  to  maintain  a  high  on-state  drive 
current  (Ion)  [2.15],  This  results  in  increased  off-state  leakage  current  (70ff)  and 
hence  increased  static  energy.  For  alternative  devices  that  are  MOSFET-like  [2.1- 
2.9],  [2.12-2.14],  the  tradeoff  between  the  dynamic  and  the  leakage  energies  is 
similar.  Note  that  Nose  and  Sakurai  [2.15]  have  previously  proven  that  for  an 
optimized  CMOS  circuit  design,  the  leakage-to-dynamic-energy-ratio  of  is  roughly 
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0.3-0. 5  across  wide  range  of  parameters.  In  this  dissertation,  we  follow  a  similar 
derivation  approach,  but  now  from  a  device  designer’s  perspective,  to  show  that 
such  an  optimal  energy  ratio  can  equivalently  be  expressed  as  an  optimal  I0JI0n 
ratio  for  CMOS.  To  reach  this  goal,  in  Appendix  I,  the  method  of  Lagrange 
multipliers  is  used  to  show  that  a  generalized  logic  device  is  energy-delay 
optimized  if  the  device  7on  and  7off  values  satisfy  the  following  condition: 

— -^+l+-^^  =  0  (2.3) 

Ldf  I  off  I  off  dl  on 

As  shown  by  (Eqn.  (2.3)),  the  optimum  70n/70ff  depends  only  on  the  circuit  topology 
and  d70ff/d7on,  a  parameter  which  is  related  to  the  subthreshold  swing,  as  will  be 
explained  later. 

Once  the  optimal  70n/70ff  value  is  found,  the  energy  minimum  can  readily  be 
obtained  by  Eqn.  (2.1),  which  can  equivalently  be  expressed  by: 

E  =  VhCLlp(^  +  'g)  (2.4) 

where  Vm  is  the  supply  voltage  required  to  reach  the  target  performance.  Since  the 
optimal  70n/70ff  is  set  by  the  circuit  topology,  we  can  see  from  Eqn.  (2.4)  that  the 
minimum  energy  is  proportional  to  the  dynamic  energy  CV dd“.  This  implies  that,  as 
will  be  explained  more  in  detail  later,  any  logic  device  that  can  achieve  the  required 
70n/70ff  value  at  a  lower  Vdd  than  CMOS  is  going  to  be  more  energy-efficient.  To 
assess  the  promise  of  alternative  switching  devices  for  replacing  the  MOSFET,  one 
needs  to  know  how  70n/70ff  of  these  devices  depend  on  Ltfla.  To  reach  this  goal,  the 
optimal  70n/70ff  for  CMOS  is  first  derived  and  then  shown  to  be  relatively  constant 


across  regions  of  operation  (strong  inversion  versus  sub  threshold). 
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2.3  Optimal  /on/foff  for  CMOS 


To  find  the  optimal  the  I-V  characteristics  of  MOSFETs  are  first 
approximated  as: 


Ids  =  hexV  vgs^Vr  +  anvth  (2.5a) 

Us  =  hea  Vgs>Vr  +  anvth  (2.5b) 


where  n  is  the  subthreshold  slope  factor  (n  =  1.67  for  S=  1 00m V/dec),  a=1.2  and  vth 
is  the  thermal  voltage. 

Using  this  I-V  model,  the  optimal  /on//0rr  can  be  found  by  first 
differentiating  /Gff  with  respect  to  Ion  (Eqn.  (2.3)): 


d,off  _  j0//  dvT  / 2  5) 

dl0n  dl0n 

High  performance  CMOS  digital  circuits  often  operate  in  the  strong  inversion 
region  ( Vo s>Vj),  therefore  dlVd/on  can  be  found  by  differentiating  Eqn.  (2.5b)  with 
respect  to  Ion,  which  gives  the  following  expression: 


1  =  Is  (—Y  a(Vdd  -  VT r-1  (—  ~  —)  (2.7) 

s  \anvth)  ““  T  \dlon  dIonJ 

Substituting  Eqn.  (2.6)  into  Eqn.  (2.7),  we  obtain  the  expression  for  d/0ff/d/on: 

^rL  =  -drJfL(v^(1-a)-v^  (2-8) 

aion  (XUiVth  * on 

Finally,  substituting  Eqn.  (2.8)  into  Eqn.  (2.3),  the  optimal  7on//0 ff  can  be  expressed 
by  the  following  equation: 

<Vdd(a-l)+VT-anvth\ 


Ion  Lclt  V* 

loff  4 a  V 


anvth 


(2.9) 


where  F<id  and  Ft  are  set  by  the  perfonnance  target  (Eqn.  (2.2)): 
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/  LdC-f(anvth)a\  Vdd 

delay  ^  2/se“  )  ( Vdd-VT)a 


(2.10) 


Typically,  the  threshold  and  supply  voltages  lie  within  the  following 
bounds:  1  > K|,i>0.4.  0.5>Et>0.3;  substituting  these  Vdd  and  Ft  values  together  with 
a=1.2  into  Eqn.  (2.9),  the  optimal  Ion/I0 ff  can  be  expressed  by  the  following 
equation: 

(2.11) 

toff  a 

where  K\  lies  within  the  range  1.6  to  3.2.  This  verities  that  for  CMOS  circuits 
operated  in  strong  inversion  region,  /0n//0ff  at  the  energy  optimum  is  mainly  set  by 
the  circuit  topology. 

Thus  far  in  this  discussion,  the  MOSFET  has  been  assumed  to  operate  in  the 
strong  inversion  region  ( Vdd>  Ex);  however,  many  alternative  devices  such  as 
TFETs  achieve  low  on-current  levels  (~1  OpA/pm)  at  supply  voltages  similar  to 
those  used  in  current  high  performance  CMOS  circuits  (~1  V).  These  devices  will 
therefore  only  be  competitive  with  subthreshold  MOSFETs  (operated  with  Vdd<Vj), 
which  dissipate  the  minimum  energy  required  for  CMOS  to  perfonn  a  given 
operation  [2.16,  2.17].  At  the  energy  optimum,  the  optimal  I0JI0n  ratio  for 
subthreshold  CMOS  similarly  can  be  derived  by  first  expressing  off-state  current  in 
tenns  on  the  on-state  current  (Eqn.  (2.5a))  and  the  delay  (Eqn.  (2.2)): 

l°ff  =  IsGXP  t£)  =  !onexP  (~^£)  =  7onexp  (-  nVthLdcf  !<>n)  (2-12) 

Therefore,  d/0ff/d/0n  is: 


dl0ff 

dlon 


(2.13) 
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Substituting  Eqn.  (2.7)  into  Eqn.  (2.3),  the  optimal  /on//0ff  ratio  can  be  expressed  by 
the  following  equation: 

=  -  h*LiambertW  (-  —eA  ~^\n  (^)  «  /f2—  (2.14) 

where  lambertW(y)  is  the  x  that  solves  the  equation  y=xex  and  K2  lies  within  the 
range  1.9  to  3.6.  This  result  is  consistent  with  previous  published  work  [2.16,  2.17]. 
From  Eqn.  (2.14),  it  can  be  seen  that  the  optimal  I0JI0n  depends  only  on  /xZd  la, 
and  also  that  even  in  subthreshold  operation,  the  optimal  I0JI0ff  ratio  stays  roughly 
the  same  as  that  for  super-threshold  MOSFETs  (Eqn.  (2.11)).  This  is  because,  as 
shown  in  Fig.  2.2,  most  of  the  change  in  /on//0ff  (as  a  function  of  VM)  occurs  in  the 
subthreshold  region,  and  therefore  even  in  strong  inversion,  the  ratio  is  relatively 
insensitive  to  small  changes  in  Kid-  As  a  result,  the  optimal  I0n/I0 ff  depends  mainly 
on  fxLd  la  and  is  relatively  insensitive  to  all  other  device  parameters. 
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Fig.  2.2.  Most  of  the  change  in  I0DH0«  (as  a  function  of  Vdd)  for  MOSFET  occurs  in 
the  subthreshold  region  (approximately  five  orders  of  magnitude)  rather  than  in  the 
strong  inversion  region  (approximately  25),  and  therefore  the  ratio  is  relatively 
insensitive  to  small  changes  in  Vdd  ■  Thus  the  optimal  IQJI0n  is  roughly  the  same  for 
both  sub-threshold  and  super-threshold  MOSFETs. 

2.4  Optimal  lonll0 ft  for  CMOS  Replacement  Devices 


Thus  far  the  discussion  has  only  focused  on  optimizing  MOSFETs.  To 
assess  the  promise  of  alternative  switching  devices  for  replacing  the  MOSFET, 
similar  analysis  can  be  applied  to  CMOS  replacement  devices.  To  reach  this  goal, 
as  derived  in  Appendix  II,  Eqn.  (2.3)  is  equivalently  expressed  by  the  following 
equation: 


— -^  +  2  -Zn 

Ldf  I off 


(2.15) 


where  Ses  is  the  effective  subthreshold  swing: 


(2.16) 
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Thus,  the  optimal  IonII0n  of  the  generalized  logic  device  depends  only  on 
fxLd  la  and  dSWf/dFdd,  where  dSefj/dFdd  is  related  to  the  log-concavity  of  the  transfer 
characteristics.  For  a  broad  variety  of  different  logic  devices  that  are  MOSFET- 
like,  the  S  value  degrades  as  the  current  level  increases,  i.e.  their  transfer 
characteristics  are  logarithmically  concave  with  cLSCff/dFdd  ~  0.  Therefore,  even 
without  knowing  the  exact  characteristics  of  a  new  device,  one  can  still 
approximate  the  optimal  /on//0ff  from  Eqn.  (15),  which  gives  the  following 
expression: 

—  K3  x  —  (2.17) 

1  o  ff  a 

where  K3  is  set  by  the  exact  of  d.S’cff/dFdd  value,  and  lies  within  the  range  ~2-8 
for  most  logic  devices,  as  shown  in  Fig.  2.3. 
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Fig.  2.3.  The  optimal  /on//0ff  of  a  generalized  logic  device  depends  on  the  logic  style 
( fxLd/a )  and  the  log-concavity  of  the  device  transfer  characteristics  (dSeff/dFdd). 
dSefi/d  (fid  are  approximately  zero  for  most  MOSFET-like  devices,  therefore 
Fon/Fofr^x  fxLd/a  ,  where  K3~  2-8. 

By  comparing  Kt,  against  K\  and  K-i  (Eqns.  (2.11)  and  Eqn.  (2.14)),  the 
optimal  /on//0ff  ratio  is  roughly  the  same  for  MOSFET-like  devices.  This  is  because, 
as  was  discussed  earlier,  most  of  the  change  in  I0JI0a  (as  a  function  of  Fdd)  occurs 
in  the  region  with  the  largest  the  steepest  effective  subthreshold  slope,  and 
therefore  the  ratio  is  relatively  insensitive  to  small  changes  in  Kid-  Using  the  fact 
that  the  optimal  /„„//(, it  ratio  is  fixed  across  a  wide  range  of  switching  devices, 
simple  guidelines  can  be  derived  to  assess  the  promise  of  MOSFET  replacement 
devices,  which  is  the  focus  of  the  following  section. 
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2.5  Benchmarking  CMOS  Replacement  Devices 
2.5.1  General  Considerations 

As  previously  alluded  to,  for  a  given  perfonnance  target  and  logic  style, 
there  exists  an  optimal  I0JI0s  ratio  to  minimize  the  total  energy,  and  this  value  is 
roughly  the  same  for  most  MOSFET-like  devices.  Therefore  if  a  logic  device  with 
a  small  subthreshold  swing  can  reach  the  required  /on//0 ff  at  a  lower  supply  voltage, 
it  will  achieve  the  same  perfonnance  with  lower  energy  dissipation.  With  this  said, 
merely  focusing  on  the  steepest  local  subthreshold  slope  ( S)  is  misleading,  since 
devices  with  very  small  S  only  at  low  current  levels  may  not  achieve  the  required 
performance.  To  compare  the  true  energy  efficiency,  Fig.  2.4  summarizes  a  simple 
method  to  assess  the  promise  of  alternative  devices  for  replacing  MOSFETs.  For  a 
given  the  circuit  topology,  one  first  detennines  the  optimal  I {J 1 0(i~~2fLJ a .  With  a 
fixed  reasonable  off-state  current  (for  instance  /0n~l  pA/pm)  for  both  devices,  one 
can  then  determine  the  required  on  state  current.  The  supply  voltage  required  for 
each  device  to  reach  such  on  state  current  is  then  graphically  determined.  With  all 
these  parameters  determined,  the  new  device  is  both  faster  and  energy  more 
efficient  than  the  MOSFETs  if  it  can  achieve  the  required  /on//0ff  at  a  lower  Vdd  (he. 
with  a  lower  S'eff).  Notice  that  a  device  with  a  small  S  value  at  low  current  levels  but 
which  require  a  large  Kid  to  reach  the  required  /<„-,//<,  rr  for  the  performance  target, 
like  the  one  shown  in  Fig.  2.4,  does  not  improve  the  overall  energy  efficiency. 
Furthermore,  the  point  at  which  the  7ds-  Vgs  curves  intersect  roughly  corresponds  to 
the  point  where  the  energy-delay  curves  cross  over. 
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Fig.  2.4  (a)  For  a  given  circuit  topology,  the  optimal  70n//0 ff  is  set  (~2 \fxLJa).  To 
compare  any  new  device  against  the  MOSFET,  a  reasonable  off-state  current  is  first 
fixed.  The  new  device  will  be  energy  more  efficient  if  it  can  achieve  the  required 
current  ratio  at  a  lower  Vdd,  i.e.  at  a  lower  effective  subthreshold  slope  (SWf)-  (b) 
The  Vdd  value  at  the  point  where  the  transfer  characteristics  intersect  in  (a)  (denoted 
as  X)  is  roughly  the  same  for  the  intersection  point  in  the  energy-perfonnance 
space  (point  X'). 
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2.5.2  Additional  Considerations 


Some  alternative  devices  (e.g.,  the  NEMFET  and  the  IMOS)  do  not  begin  to 
conduct  current  until  sometime  after  the  control  voltage  arrives,  leading  to  an 
additional  setup  delay  (tsu).  The  delay  time  is  therefore: 

tdelay  =  tRC  +  tsu  =  LdCfVdd  (1  +  YsJ ,  where  ysu=tsu/tRC  (2.18) 

Following  a  similar  procedure  as  shown  in  Appendix  I,  the  optimal  Ion  and  70ff  is 
determined  by  the  following  condition: 

- — - }on_  ^  Jon^^£off_  _  o  (2.19) 

f J //  l off  dl0n 

Comparing  Eqn.  (2.18)  and  Eqn.  (2.19)  with  Eqn.  (2.2)  and  Eqn.  (2.3),  we  can  see 
that  these  devices  not  only  need  to  have  l+ysu  times  higher  on  current,  but  also 
I  +;/su  times  higher  /0n//0ff  (i.e.  I  +ysu  times  smaller  in  Seff  than  the  MOSFET)  to 
compensate  for  the  increased  leakage  energy. 

Furthermore,  for  devices  with  a  large  setup  time,  circuit  topologies  with 
short  logic  depth  but  large  fan-out  per  stage  are  preferred  to  minimize  the  overall 
delay.  For  the  case  of  a  simple  buffer  chain,  the  optimal  fan  out  per  stage  f  m  and 
optimal  logic  depth  Lopt  can  be  estimated  [2.18]  by  the  following  equations: 

fopt  =  exp  (l  +  ,  Lopt  =  In  (CL/Cin)/ln  fopt  (2.20) 

where  Cm  in  the  input  capacitance  of  the  inverter  chain. 

In  addition  to  the  non-zero  setup  time,  many  MOSFET  replacement  devices 
may  have  gate  or  other  parasitic  capacitance  (denoted  as  C’);  the  switching  delay 
for  these  devices  are: 
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tdeiay  =  ^r^Yc,  where  yc=C’/C  (2.21) 

For  these  devices,  the  optimal  7on/7off  remains  unchanged,  but  the  on  current 
can  be  yc  times  lower  for  the  same  perfonnance  target.  However,  it  should  be  noted 
that  for  a  given  switching  energy,  even  if  a  device  has  a  low  gate  capacitance,  it 
may  not  necessarily  allow  for  higher  supply  voltage.  This  is  because  the  device 
layout  area  impacts  capacitance  ( e.g .  of  interconnect  wires)  and  thereby  impacts 
circuit  switching  energy  and  constrains  the  supply  voltage  that  can  be  used. 

In  setting  device  and  circuit  design  parameters  to  optimally  balance  leakage 
and  dynamic  energies,  it  is  critical  to  consider  the  impact  of  variations.  For 
example,  since  70ff  varies  exponentially  with  the  Ft  of  a  MOSFET,  the  average  70ff 
is  much  higher  than  70ff(FTaverage);  thus,  maintaining  the  appropriate  energy  ratio 
requires  a  lower  nominal  70ff.  In  contrast  to  energy,  the  performance  of  a 
synchronous  digital  circuit  is  set  by  the  critical  paths.  While  there  is  some  summing 
of  delay  variations  along  the  path,  the  paths  are  not  very  long,  so  variations  remain. 
Thus  Ion  must  be  increased  to  ensure  all  paths  meet  the  perfonnance  target  for  the 
worst-case  variations. 

Applications  with  low  perfonnance  demands  or  large  amounts  of  parallelism 
can  tolerate  reduced  device  performance  [2.19],  so  that  Fdd  can  be  scaled  more 
aggressively  (with  margin  for  variation)  to  reduce  energy.  It  can  be  shown  that  for 
these  applications,  7on/Fdd  is  not  as  critical  as  the  minimum  supply  voltage  kid.mm, 
which  depends  only  on  maintaining  the  optimal  70;/70j^  ratio  and  is  proportional  to 

Se«- 
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Before  moving  on,  it  should  be  noted  that  even  if  a  device  has  low  Ses  but 
requires  a  non-zero  output  voltage  (Vds)  [2.6,  2.14]  to  conduct,  it  may  not  improve 
the  overall  energy  efficiency.  This  is  because  digital  gates  built  which  such  a 
device  would  either  dissipate  significant  static  power,  or  would  be  significantly 
constrained  in  terms  of  the  number  of  devices  that  can  be  connected  in  series. 


2.6  TFET  Comparisons  with  CMOS:  An  Example 


To  illustrate  how  the  aforementioned  methodology  can  be  used  to  assess  the 
promise  of  a  MOSFET  replacement  device,  we  herein  use  the  TFET  as  an  example 
and  compare  it  against  the  MOSFET.  The  TFET  (Fig.  2.5)  utilizes  band-to-band 
tunneling  (BTBT)  current  to  achieve  a  more  abrupt  on-to-off  transition  than  what  is 
achievable  through  thennionic  emission.  For  the  purposes  of  this  study,  it  is 
adequate  to  simply  approximate  7ds  using  the  band-to-band  tunneling  model  [2.20, 
2.21],  which  predicts: 

lds=AEsexp(-^ f^tyAEsexp(-l)  (2.22) 

where  Es  is  the  electric  field  in  the  region  where  the  tunneling  occurs,  which  can  be 
estimated  for  a  source-tunneling  FET  by  the  following  simple  expression: 


t-t  (y gs+V  tunnel) 

£c  = - 

Ktox 


(2.23) 


where  q  Ftunnei  is  the  minimum  energy-band  bending  needed  for  band-to-band 
tunneling  to  occur,  k  is  the  ratio  of  the  semiconductor  pennittivity  to  the  gate  oxide 
pennittivity  and  tox  is  the  gate-oxide  thickness  over  the  source  [2.22], 
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OFF  State  ON  State 


Fig.  2.5  A  schematic  diagram  and  the  energy  band  diagram  in  the  OFF/ON  state  of 
a  source-tunneling  field  effect  transistor. 

This  band-to-band-tunneling  current  model  is  used  as  it  matches  well  with 
the  measured  data  of  the  germanium-source  TFET  [2.23]  and  the  BTBT  off-state 
leakage  current  in  silicon  MOSFETs  [2.22].  Note  that  a  TFET  has  a  very  small  S 
value  at  low  current  levels,  but  that  S  increases  as  7ds  increases.  Furthermore,  at 
high  Vdd  (/-TV)  values,  a  silicon  TFET  has  a  significantly  lower  on- state  current  7on 
(~  1  p  A/pm  at  IV)  than  a  MOSFET  (lmA/pm  at  IV).  Current  research  efforts  focus 
on  improving  7on,  for  instance,  by  using  a  smaller-bandgap  material  such  as 
gennanium  [2.23,  2.24].  Therefore,  two  representative  TFET  technologies  are 
compared  herein:  a  nonnal  low  7on  TFET  technology  (“low  7on  TFET  technology”) 
and  an  advanced  TFET  technology  that  provides  a  high  7on  (“high  7on  TFET 
technology”)  to  show  the  implications  of  the  energy-performance  analysis.  The 
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device  design  parameters  and  the  I-V  characteristics  of  both  TFET  devices  are 
shown  in  Table  I  and  in  Fig.  2.6. 

1.E-04 
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1.E-06 
1.E-07 

E"  1.E-08 

a 
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Fig.  2.6  Simulated  I-V  characteristic  of  two  representative  TFET  technologies:  the 
“low  /on”  and  “high  /on”  TFET  technologies.  The  Ftunnei  values  for  low  /on  and  high 
7on  TFETs  are  respectively  0.13V  and  0.07V. 

Assuming  F[unnci  is  a  parameter  that  can  be  adjusted  by  utilizing  advanced 
processing  technologies  (e.g.  gate  work  function  engineering),  the  optimal  I0JI0n 
can  readily  be  found  from  Eqn.  (2.22)  and  Eqn.  (2.23): 

ton  dIoff  (Q.i-0.3 -vdd)  f  x  |  b'  \  (224) 

loff  dl0n  V tunnel  '  V tunnel  ' 

where  B  ’=KtoxB.  For  typical  values  of  B  -3V,  Ftunnei  =0.2V  and  Vdd  =0.5V,  I0JI. off 
x  d/oft/d/on  can  be  computed  from  Eqn.  (2.24)  and  its  value  is  roughly  -30;  the 
optimal  IOD/Io ff  is  therefore  roughly  8 xLtfla,  which  matches  the  prediction  of  Eqn. 
2.17. 
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Performance  (GHz) 

Fig.  2.7  Energy-Performance  comparison  of  30-stage  F04  65nm  CMOS  inverter 
chain  with  a  65nm-equivalent  two  different  TFET  technology. 

With  this  in  mind,  Fig.  2.7  shows  the  simulated  energy-performance 
comparison  of  a  65nm  MOSFET  vs.  a  65nm-equivalent  TFET,  for  a  30-stage  fan- 
out-4  inverter  chain  (transition  probability=0.01,  with  optimized  Fdd,  Ft,  and  Funnel 
values).  The  device  parameters  for  the  MOSFET  (Table  2.2)  are  chosen  according 
to  the  ITRS  specifications  for  the  65nm  LSTP  technology  [2.25].  With  these  circuit 
design  parameters  (Zd,=30  a=().()  I  and /= 4)  given,  the  optimal  /on//0ff  ratio  for  both 
CMOS  and  TFET  is  approximately  2.4x1 04.  The  SCit  values  for  the  two  TFETs  to 
reach  this  Ion/I0ff  ratio  at  different  /on/F dd  values  are  plotted  in  Fig.  2.8.  By 
overlapping  the  I-V  characteristics  of  the  TFET  with  that  of  MOSFET  (with  70ff 
fixed  at  0.1pA/|Jm  for  both  devices),  one  can  see  that  for  relatively  slow  (sub-50 
MHz)  applications  where  7on/Fdd  is  not  critical,  both  high-7on  and  low-7on  TFETs 
have  smaller  Se ff  values  and  hence  can  be  more  energy-efficient  than  a  MOSFET. 
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For  moderate  (50-500MHz)  performance  applications,  only  the  high-/on  TFET  can 
achieve  5efr<  lOOmV/dec  ;  and  for  high-performance  applications  (beyond  1GHz) 
requiring  7on  >  1  OOpA/pm,  both  TFET  technologies  have  Ses> lOOmV/dec  and  they 
therefore  would  consume  more  energy  than  a  MOSFET.  At  the  I0JV dd  where  the 
TFET  achieves  the  same  S'efr  value  as  a  MOSFET  (denoted  as  A’  and  B’  in  Fig.  2.8a 
and  A"  and  B"  in  Fig.  2.8b),  both  devices  consume  roughly  the  same  amount  of 
energy  (points  A  and  B  in  Fig.  2.7).  Based  on  this  analysis,  the  high-/on  TFET 
technology  appears  to  be  compelling  for  low  power  applications  up  to  -lOOMHz. 

This  simplified  energy-perfonnance  analysis  thus  far  assumes  logic  devices 
just  drive  other  devices;  in  reality,  however,  extrinsic  wire  capacitance  (Cw)  must 
be  considered  in  the  analysis  too,  especially  if  the  device  has  an  area  overhead.  Fig. 
2.9  shows  the  sensitivity  of  the  energy  consumption  as  a  function  of  the  wiring 
capacitance.  For  slow  (50MHz)  applications,  a  TFET  operates  at  a  lower  Vdd  than  a 
MOSFET  and  hence  its  energy  consumptions  (ocCwFdd  )  is  less  sensitive  to  Cw;  for 
high  performance  applications  (1GHz),  the  TFET  operates  at  a  higher  Vdd  and 
hence  the  energy  consumption  is  more  sensitive  to  Cw. 
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Low  Ion  TFET 

High/on  TFET 

Physical  Gate  Length  (nm) 

45nm 

A  (A/V/pm) 

3.82E-13 

7.  26E-13 

B  (MV/cm) 

7.78 

4.09 

Equivalent  oxide  thickness  (nm) 

1 

K 

4 

C  (fF) 

0.741fF 

Table  2.1.  Summary  of  device  parameters  for  the  65nm  equivalent  TFET 
technology  used  in  this  work. 


Value 

Physical  Gate  Length  (nm) 

45nm 

a 

1.18 

n 

1.667 

Is  (pA/pm) 

8.259 

C  (fF) 

0.741fF 

Table  2.2.  Summary  of  device  parameters  for  the  65nm  MOSFET  technology  used 
in  this  work. 
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Fig.  2.8  (a)  Effective  subthreshold  values  of  TFET  and  MOSFET  as  different 
f on/ Fid  values  for  a  given  circuit  topology.  For  low-performance  applications,  a 
TFET  can  achieve  the  optimal  I0JI0n  ratio  with  a  small  .Scrr  and  therefore  it  is  more 
energy  efficient  than  a  MOSFET  Note  that  the  cross-over  points  (denoted  as  A’  and 
B’  in  the  figure)  roughly  correspond  to  the  cross-over  points  (denoted  as  A  and  B  in 
the  Fig.  2.7)  in  the  energy-perfonnance  space  and  in  transfer  characteristics 
(denoted  as  A”  and  B”  (b)). 
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Cw/Cgate 


Cw/Cgate 


Fig.  2.9  Impact  of  wiring  capacitance  on  energy  dissipation.  For  low  performance 
application,  a  TFET  is  operated  at  a  lower  Vdd  and  wiring  capacitance  is  less  of  an 
impact  to  the  energy  consumption.  For  high  perfonnance  application,  however,  a 
TFET  needs  a  higher  Vdd  to  provide  for  the  high  on-state  current.  Therefore  the 
energy  consumption  is  more  sensitive  to  wiring  capacitance. 
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2.7  Conclusion 


In  this  chapter,  a  simple  evaluation  guideline  is  established  to  assess  the 
promise  of  new  device  technologies.  Based  upon  the  energy-perfonnance  tradeoff 
of  logic  gates,  it  is  shown  that  the  optimal  I0JI0ff  ratio  for  logic  devices  depends 
largely  only  on  the  circuit  topology,  and  that  this  optimal  I0JI0n  stays  roughly 
constant  across  a  wide  range  of  switching  devices.  With  this  optimal  /on//0ff  ratio  in 
mind,  it  is  then  shown  that  the  effective  subthreshold  swing,  rather  than  the  steepest 
local  subthreshold  swing  determines  the  energy  efficiency  of  logic  devices.  As  a 
concrete  example,  this  methodology  is  used  to  compare  TFETs  against  MOSFETs, 
showing  that  TFETs  may  offer  substantial  (~5x)  energy  savings  for  performance  up 
to  the  100MHz  range. 
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Appendix  I.  Energy-Delay  Optimization  of  the 
Generalized  Logic  Device 


To  minimize  energy  consumption  subject  to  a  delay  constraint,  the 
Lagrange  multiplier  method  is  used  to  set  up  the  following  expression: 

L(Vdd>  I  on)  =  E(ydd,Ion)-A(tdelayIon-L-^Vdd)  (A.l) 

Where  E(VddJ0n)  is  the  energy  consumption 
E(Ydd,Ion)  =  aLdVddCf  +  LdfIoffVdd 

tdelay  (A.2) 


And  D(  Lid, /on)  is  the  delay  constraint 


D(Ydd’  Ion)  tdelaylo 


LdfC 


V, 


dd 


(A. 3) 


ldelaylon 

For  a  broad  variety  of  different  alternative  devices,  the  off  state  leakage 
current  depends  only  on Ion  and  Vdd-  Hence  differentiating  L(VddJon)  with  respect  to 


Vdd- 

^ :  2  aLdVddCf  +  Ldf!offtdelay  +  LdfVditdeldy  ^  =  0  (A.4) 


which  can  be  simplified  to: 


2 1  o  f ft  delay  .  2V  dd^delay  d Vff  _ 


4  aVdd  +  + 

aa  C  C  dVdd 


=  -A 


As  the  total  differential  of  70ff  is: 


dioff  =  d-^dVdd  +  °-^lLdlon 
°ff  dVdd  dd  dIon  on 


dln 


dl 


off 


dVdd 


can  therefore  be  expressed  by: 


dioff  dlgff dioff  dIon  dl0ff  /  dl0n  \  dl0ff  dl0n 

dVdd  dVdd  dIon  dVdd  dIon  \dV dd'  dIon  dVdd 


(A. 5) 


(A.  6) 
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Since  tdelay  =^-Vdd,  ^  Therefore, 


1 

o 

fdIoff 

O 

fO 

1 

i  Ion 

dvdd 

^  dlpn 

dl0n  ^ 

Vdd 

Hence  (A. 5)  is 

4 aVdd  +  2/o//tdetoy  +  Ld/Vdd  f^2££  _  ^££)  =  -A 

aa  C  aj  aa  V  dl0n  dlon ) 

On  the  other  hand,  if  I(Vdd,I0n)  is  differentiated  with  respect  to  I0 


dL 

d  Inn 


'■  LdfV dd.t delay  ^  delay  0 

01  on 


LdfVdd 


dl, 


off 


din 


Equating  (A. 8)  and  (A.  10),  we  get 

4^  +  +  LdfVdd  (%L-%Z).  -LdfVdd 


dl 


off 


dln 


which  is  equivalent  to: 

— — — Edd  +  1  +  LdfVddC  dI°ff  =  0 

of f tdelay  2/ of f^delay  d.Ion 

Substituting  the  delay  expression  to  (A.  12),  we  finally  obtain  (2.3): 


4&  Ion  _|_  ^  _|_  Ion  dlpff  _  q 


^df  Ipff 


Ipff  dl0n 


(A.7) 

(A. 8) 

(A. 9) 
(A.  10) 

(A.  11) 

(A.  12) 

(A.  13) 
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Appendix  II.  Derivation  of  Equation  (2.15) 


Equation  (2.3)  can  equivalently  be  expressed  by  (2.15)  by  first  knowing  that: 


l0n  dl0ff  _  dlog10Ioff  _  dlog10lon-Se^V  dd  _  dSej^Vdd 

1 off  dlQn  dlog10Ion  dlog10Ion  dlog^lQ-j^ 

And  for  a  fixed  perfonnance,  (2.2)  can  be  expressed  by: 


l°9io  Ion  ~  log10 


delay 


;)  +  log 


10  vddL 


Therefore,  using  the  chain  rule,  (A.  14)  can  be  simplified: 


ds-e?fvdd  _  ( 

1  dlog10Vdd  ~  1 


d-^1L  +  l)  S;hVdd  In  10 
dvdd  )  ei>  aa 


After  simplification,  (A.  16)  becomes: 

Substituting  (A.  17)  into  (2.3),  we  obtain  (2.15): 

--^-+2-ln(-^-)(l-log10  =  o 

Ldf  loff  VoffJ  V  Voff)  dVddJ 


(A.  14) 


(A.  15) 


(A.  16) 


(A.  17) 


(A.  18) 
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Chapter  3 


Nano-Electro-Mechanical 
Field  Effect  Transistor  Design 

3.1  Introduction 

In  Chapter  2,  it  was  shown  that  the  effective  subthreshold  swing  (rather  than 
the  steepest,  local  subthreshold  swing)  determines  a  device’s  energy  efficiency. 
With  this  consideration  in  mind,  alternative  transistor  designs  which  offer  perfectly 
abrupt  off-to-on  transition  to  provide  for  high  on/off  current  ratio  with  a  smaller 
supply  voltage  (i.e.  small  5efr  value)  are  attractive  for  energy-efficient  electronics. 
One  such  device  is  the  nano-electro-mechanical  field  effect  transistor  (NEMFET) 
[3.1-3.10],  which  utilizes  the  pull-in  and  release  behavior  of  a  mechanical  beam  to 
achieve  a  perfectly  abrupt  switching  transition,  and  an  effective  subthreshold  swing 
S  that  is  less  than  60mV/dec. 

In  addition  to  low-power  digital  logic  applications,  NEMFETs  also  have 
been  proposed  for  analog  circuit  applications  such  as  resonators  and  sensors  [3.8- 
3.10],  The  motion  of  the  mechanical  gate  (or  body)  changes  the  equivalent  gate- 
oxide  thickness  and  hence  the  transistor  current,  so  that  a  mechanical  signal  can  be 
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effectively  converted  into  an  electrical  signal  with  high  transduction  efficiency 
[3-9]. 

For  digital  logic  applications,  the  pull-in  voltage  Vpi  and  the  release  voltage 
Vri  of  a  NEMFET  are  important  performance  parameters  since  they  determine  the 
turn-on  and  turn-off  voltages  of  the  transistor,  respectively  [3. 1-3.7].  Ideally,  pull- 
in  should  occur  in  the  sub-threshold  regime  of  operation,  i.e.  Fpi  should  be  less  than 
Vj  (defined  as  the  gate-to-source  voltage  Vgs  at  which  the  transistor  current 
becomes  linearly  dependent  on  Vgs)  to  achieve  the  highest  on/off  current  ratio  for  a 
given  gate -voltage  swing.  \VTi\  should  be  greater  than  zero  to  ensure  that  the 
transistor  turns  off  properly,  i.e.  that  it  is  in  the  off  state  for  Vgs  =  OV.  On  the  other 
hand,  for  analog  circuit  applications  [3.8-3.10],  Vpi  sets  an  upper  limit  for  the  bias 
voltage  and  should  be  much  higher  than  Vt  to  allow  for  a  large  DC  bias  current. 
Thus,  an  accurate  model  for  the  pull-in/release  voltages,  as  well  as  the  threshold 
voltage,  is  needed  to  guide  NEMFET  design  for  various  applications. 

Previous  modeling  efforts  used  a  simple  lumped  parameter  model  [3. 1-3.3] 
to  study  the  behavior  of  NEMFETs.  While  this  approach  provides  intuition  for 
NEMFET  design,  it  does  not  account  for  two-dimensional  effects,  e.g.  a  non- 
uniform  actuation  gap  thickness  due  to  bending  of  the  gate  electrode.  Furthermore, 
previous  efforts  lacked  a  discussion  of  the  conditions  necessary  for  pull-in/release 
to  occur  in  the  sub-threshold  vs.  inversion  regime  of  FET  operation.  To  address 
these  shortcomings,  in  this  chapter,  the  Euler-Bernoulli  equation  (applicable  to 
mechanical  beams  and  widely  used  for  modeling  micro-electromechanical  systems 
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[3.11-3.14])  is  solved  simultaneously  with  the  Poisson  equation  to  accurately 
model  Vpi  and  Vx\  of  a  mechanically  gated  FET.  Using  this  model,  the  effects  of 
various  device  design  parameters  ( e.g .  body  doping  concentration,  gate  stiffness, 
as-fabricated  actuation  gap  thickness,  and  source-to-body  bias  voltage)  and  surface 
adhesion  force  are  assessed  in  Section  3.3.  A  unified  pull-in/release  voltage  model 
which  accounts  for  these  effects  is  then  provided  in  Section  3.4. 

Although  the  NEMFET  effective  subthreshold  swing  is  reduced  by  utilizing 
the  pull-in  effect,  the  presence  of  an  air-gap  in  the  transistor  drastically  worsens  the 
short  channel  effects;  the  impact  of  such  effects  on  the  NEMFET ’s  scalability  for 
logic  applications  is  discussed  in  Section  3.5. 

3.2  Physics  of  NEMFET  Operation 

Fig.  3.1  illustrates  the  NEMFET  structure,  which  is  essentially  a  metal- 
oxide-semiconductor  field  effect  transistor  (MOSFET)  with  a  movable  gate 
electrode  that  can  be  physically  separated  from  the  gate  dielectric  layer  by  an  air 
gap  (or  vacuum  gap).  As  shown  in  the  cross-sectional  schematic  in  Fig.  3.2a,  the 
suspended  gate  is  a  doubly-clamped  beam  anchored  on  each  side  of  the 
semiconductor  channel. 

Fig.  3.1  also  presents  the  NEMFET  operation.  In  the  off  state,  an  air  gap 
separates  the  gate  from  the  gate  dielectric;  as  Vgs  increases,  the  electrostatic  force 
attracts  the  gate  to  the  gate  dielectric.  In  the  on  state  (Fig.  3. Id)  where  the  gate-to- 
source  voltage  is  greater  than  the  pull-in  voltage  (Upi),  the  gate  is  pulled  down  and 
is  in  contact  with  gate  dielectric,  which  increases  the  gate-to-channel  coupling, 
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Once  the  gate  is  pulled  in,  the  thin  dielectric  thickness  ensures  that  the  electrostatic 
force  is  larger  than  the  spring  restoring  force,  and  therefore  the  NEMFET  exhibits 
hysteretic  switching  (Fig.  3.1  (d))  -  i.e.  the  release  voltage  Vgs=Vr\  value  is  lower 


than  VPi. 


Fig.  3.1.  (a)  Schematic  diagrams  of  a  nano-electro-mechanical  field  effect  transistor 
(NEMFET).  (b,c)  Simple  lumped-parameter  model  for  a  NEMFET.  (d)  the  abrupt 
pull-in/  release  of  the  gate  electrode  provide  for  perfectly  abrupt  on/  off  transitions. 

The  change  in  the  gap  thickness  at  Vgs=Vpi  (or  Vgs=  Vx\  )  can  be  equivalently 
described  as  a  dynamic  reduction  (or  increase)  in  the  threshold  voltage  Ft.  If  Ft  for 
a  MOSFET  is  defined  as  the  value  of  Vgs  for  which  the  channel  is  just  barely 
inverted,  then  the  change  in  Ft  due  to  the  movement  of  the  gate  at  Vgs=  Vpi  and  Vgs= 

Fri  is  approximately  |AF7  |  ~  tgap  ^A£siqNa^b  [3.1],  where  tgap  is  the  as-fabricated 

air-gap  thickness,  Na  is  the  body  doping  concentration,  and  2(j)b  is  the  channel 
potential  at  the  onset  of  strong  inversion  in  the  channel.  Since  the  threshold  voltage 
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changes  abruptly  as  \  Vgb\  is  increased  above  Vp„  (or  decreased  below  Fri)  the  h-Vg 
characteristic  of  the  NEMFET  exhibits  an  extremely  steep  (nearly  infinite) 
subthreshold  slope  [3. 1-3.4]  at  Fpi  and  Vt\.  To  achieve  low  Rvalue,  the  supply 
voltage  must  be  scaled  down.  To  achieve  this  goal,  Vvl  and  Vx\  need  to  be 
minimized.  In  the  pursuit  of  this  goal  and  to  facilitate  NEMFET  design,  this  section 
aims  at  developing  an  accurate  model  for  Fpi  and  Vr\. 

3.2.1  Lumped  Parameter  Model 

A  simple  lumped  parameter  model  previously  has  been  used  to  study  the 
behavior  of  NEMFETs  [3. 1-3.3],  In  this  model,  as  depicted  in  Fig.  3.1b,  the  gate  is 
treated  as  a  simple  linear  spring  (with  a  characteristic  spring  constant  k )  suspended 
over  the  semiconductor  channel.  The  gate  and  the  channel  form  a  parallel-plate 
capacitor  with  an  equivalent  air-gap  that  is  uniform  in  thickness  across  the 
transistor  channel.  The  details  of  this  lumped  parameter  model  are  well  covered  in 
the  literature  [3. 1-3.3];  the  key  results  are  summarized  herein. 
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Fig.  3.2:  (a)  Schematic  diagrams  illustrating  the  physical  parameters  of  the 
NEMFET.  (b-d)  Shapes  of  the  deflected  gate  beam,  corresponding  to  the  different 
regions  of  NEMFET  operation:  (b)  Before  pull-in,  (c)  Vgs  >  Vph  (d)  Vgs  =Vr\. 

Current  flow  in  the  semiconductor  channel  is  controlled  by  the  voltage 
applied  between  the  gate  and  the  source,  Vgs.  Because  the  gate  beam  is  assumed 
not  to  bend,  the  voltage  drops  across  the  air  gap  Vgap,  the  dielectric  Vox  and  the 
semiconductor  surface  potential  (fh  are  assumed  to  be  independent  of  x: 

Vgs  +  Vsb  =  Vfb  +  Vgap  +  Kx  +  0s  =  Vfb  +  *4//  +  4>s  (3-1) 

where  the  flat-band  voltage  Vfb  is  a  function  of  the  channel  dopant  concentration 

kT  fN  \ 

Na:  Vfb  —  0m  —  0S  —  C  —  —  In  where  C  is  a  constant  and  0m  and  &s  are  the 

work  functions  of  the  gate  and  the  semiconductor  channel,  respectively.  Veff  =  Vgap 
+  Vox  is  the  voltage  drop  across  an  equivalent  air  gap.  If  the  gate  beam  is  not  pulled 
in,  the  spring  restoring  force  is  equal  to  the  electrostatic  force: 
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k  (W  +  °x  9) 


e0WLV2ff 


k  *  39.5 


EWh3 


(3.2) 


2 g2  ’  L3 

where  g  —  d  +  tgap  +  t0X/K0X  is  the  equivalent  air-gap  thickness,  d< 0  is  the 
displacement  of  the  mechanical  gate,  tox  and  kox  are  the  thickness  and  dielectric 
constant  of  the  gate  dielectric,  k  is  the  spring  constant  for  a  clamped-clamped  beam, 
h  and  W  and  L  are  the  thickness  width  and  length  of  the  gate  beam,  respectively.  E 
is  the  Young’s  modulus  of  the  gate  beam  material.  Non-ideal  effects  such  as 
dielectric  charges  and  fringing  capacitances  are  assumed  to  be  negligible. 

To  ensure  proper  NEMFET  operation,  the  spring  restoring  force  must  be 
significantly  greater  than  the  surface  adhesion  force,  Fa.  In  the  absence  of  capillary 
forces,  the  adhesive  interactions  are  dominated  by  the  attractive  van  der  Waals 
force  between  non-contacting  surfaces  [3.15]: 

(3.3) 


2T  , 


Fa=-WL,  for  0  <  \d\  <  tgap 

Uq 


where  T  is  the  adhesion  energy  per  unit  area  and  d0  is  the  average  distance  between 
the  two  surfaces. 

Feff  can  be  computed  from  the  amount  of  areal  charge  in  the  channel,  Qs: 


V. 


eff 


-~Qs 


(3.4) 


where  Qs  depends  on  the  channel  potential  <f>s.  To  simplify  the  analysis,  we  can 
approximate  Qs  as  follows  [3.16]: 


—■j2£siqNa4>s  4>s  <  2(pb  +  Vsb ;  in  depletion 

Q s  j  nj  TT~1  Tf  2kZfdt>s-2-<fib~V sb)  ,  ^  «  ,  .  T/  ..  . 

{—g  2£siNakbTe2lcbI  <ps  >  2 cpb  +  Vsb ;  in  inversion 
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where  £Sj  is  the  dielectric  permittivity  of  the  silicon  channel  material,  Na  is  the 
channel  dopant  concentration,  kb  is  the  Boltzmann  constant,  and 
</^=(kh T/q )  1  n (Wa/ n\).  Solving  Eqns.  (3. 1-3.5)  either  numerically  [3. 1-3.2]  or 
analytically  [3.3]  gives  the  position  of  the  gate  and  semiconductor  surface  potential 
for  different  gate  voltage  biases,  including  Vpi  and  Vr\.  The  details  of  the  solutions 
are  discussed  elsewhere  [3. 1-3.3].  We  will  use  this  model  as  a  reference  for 
comparison  against  the  more  accurate  Euler-Poisson  model. 

3.2.2  Euler  -  Bernoulli  Equation  for  the  Mechanical  Gate 
Beam  Shape 

To  accurately  model  the  switching  behavior  of  the  NEMFET,  the  non- 
uniform  actuation  gap  thickness  due  to  bending  of  the  gate  electrode  must  be  taken 
into  account.  The  various  voltage  drops  are  functions  of  the  position  along  the 
beam  length  direction  (x): 

Vgs  +  Vsb  =  Vfb  +  14// (x)  +  0s(x)  (3.6) 

An  applied  gate  voltage  results  in  an  electrostatic  force  on  the  mechanical 
gate.  The  shape  of  the  gate  beam  can  be  found  by  solving  the  Euler-Bernoulli 
equation  [3.17]  and  depends  on  the  state  of  the  beam:  not  pulled  in  ( Vgs<Vpi ),  as 
illustrated  in  Fig.  3.2b;  pulled  in,  as  illustrated  in  Fig.  3.2c;  or  just  at  the  point  of 
release  (Vgs=Vri),  as  illustrated  in  Fig.  3.2d. 
i.  Beam  not  pulled  in 
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Before  pull-in  occurs,  i.e.  when  the  gate  and  the  gate-dielectric  are  not  in  contact, 
the  shape  of  the  mechanical  gate  is  governed  by  the  following  equation: 

El^^~  ^ - T™  (3’7) 

where  /  is  the  moment  of  inertia  of  the  gate  beam.  Note  that  in  Eqn.  (3.7),  non¬ 
ideal  effects  such  as  residual  stress,  vertical  strain  gradient,  beam  stiffening  due  to 
bending,  dielectric  charges  and  fringing  capacitances  are  assumed  to  be  negligible. 

For  a  clamped-clamped  beam,  we  can  take  advantage  of  symmetry  (as 
depicted  in  Fig.  3.2a)  to  establish  the  boundary  conditions  at  x=0  and  x=Lt. 2  as 
tabulated  in  Table  3.1. 


ii.  Beam  pulled  in 

While  the  electrostatic  force  (Feiec)  increases  quadratically  with  increasing 
displacement,  the  spring  restoring  force  (Fspring,  which  counteracts  the  electrostatic 
force)  increases  only  linearly  with  displacement.  Hence,  there  is  a  critical 
displacement  beyond  which  Fe\ec  is  always  larger  than  Faring,  causing  the  gap  to 
close  abruptly.  This  critical  displacement  has  a  corresponding  value  of  |Fgs|  known 
as  the  “pull-in”  voltage  FPi.  Upon  pull-in,  a  portion  of  the  gate  will  zip  into  contact 
with  the  gate  dielectric,  as  illustrated  in  Fig.  3.2c.  Denoting  la  as  the  total  length  of 
the  gate  regions  which  are  not  in  contact  with  the  gate  dielectric,  the  shape  of  the 
gate  electrode  can  be  determined  in  a  piecewise  manner: 


d4g(x)  _  £0WVeff  pQ2 


- W  for  x  —  > 


g(x)=^for  x-t  <L~F 

K.0x  z  z 


iii.  Beam  at  the  point  of  release 
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With  the  beam  pulled  in,  /a  increases  as  Vgs  decreases.  If  Fgs  is  reduced  to  the 
release  voltage  ( i.e .  if  Vgs=Vr\),  then  la=L  and  the  gate  touches  the  gate  dielectric 
only  at  x=LI2.  Thus,  Eqn.  (3.8)  is  reduced  to  Eqn.  (3.7). 

The  governing  equations  and  boundary  conditions  for  the  different  regimes  of 
gate-beam  operation  are  summarized  in  Table  3.1.  To  detennine  the  shape  of  the 
gate,  Eeff(x)  must  be  known.  It  is  related  to  the  semiconductor  surface  potential 
which  can  be  found  by  solving  the  Poisson  equation. 


Region 
of  Op. 

Governing  Equation 

x=0 

x=L/2  or  /a/2 

Before 

Pull-in 

d4g(x)  _  £0WVeff(x)2  2T 
dx4  2  g(x)2  d0 

g(x) 

go, 

g’W 
=  0 

g'(L/2)=0 

g3(L/2)=0 

At  release 
voltage 

g  (L/2)  t0jK0X 
g’(L/2)=0 
g3(L/2)=0 

Zip-in 

( ‘oWVf,f  2r  L  L-i. 

]  dx 4  2  g(x)2  d0  1  2  2 

{  gix)=^Lfor  x-k 

V  Kqx  z  z 

c/ 2)  t ox/f^ox 

g’(/a/2)=0 

g2(/a/2)=0 

Table  3.1:  The  governing  equation  and  the  boundary  conditions  for  the 
mechanical  gate  in  different  regions  of  operation  (gate  not  pulled  in,  gate  pulled  in, 
and  gate  at  point  of  release). 


3.2.3  Poisson  Equation  for  the  Semiconductor  Surface 
Potential 


Eeff(x)  and  (fh(x)  can  be  found  by  solving  the  Poisson  equation  [3.16],  and 
the  result  is  as  follows: 
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VeffOO  ~ 


a(x) 


-J  2(j£siNa 


(0s  0*0  V sb) 


kbT  -  kRTVsh 


0s(x)-^e  *a 


(3.9) 


Note  that  Eqn.  (3.9)  assumes  a  long-channel  MOSFET  with  small  drain-to-source 
voltage  (so  that  the  drain  voltage  does  not  significantly  impact  the  channel 
potential). 

For  a  given  ^s(x),  the  channel  potential  <p(x,y)  can  be  solved  iteratively  using  the 


equation: 


94>(x,y)  _  -^2 qEsiNa 
dy  £si 


kkLe~^(x’y) 

Q 


+  0U,y)-if)  +  |(  kfe*l 


-^(4>{x,y)-v  Sb) 


<p(x,y)  —  ~^f-e  kBTVsb^j 


(3.10) 


with  the  following  boundary  conditions  at  the  channel  surface  and  deep  within  the 
semiconductor: 


0(*.y) \y=0  =  0s (*)  >  0(*»y)ly-*oo  =  0.  (3.11) 

The  static  behavior  of  a  NEMFET  in  each  regime  of  gate-beam  operation  is 
obtained  by  solving  Eqns.  (3.6)-(3.1 1).  Numerical  simulations  utilizing  the  finite 
difference  and  Newton-Raphson  methods  are  implemented  using  Matlab  7;  the 
details  of  the  simulation  methodology  used  to  study  the  behavior  of  a  NEMFET  are 
summarized  in  Fig.  3.3 
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Fig.  3.3  :  Simulation  methodology  used  to  study  the  behavior  of  a  NEMFET. 


Parameter 

Value 

Parameter 

Value 

Young  Modulus,  E 

160GPa 

Dielectric 
constant  kox 

3.9 

Beam  Width  Gate 
Length,  W 

lOOnm 

Dielectric 
thickness,  tox 

Inm 

Beam  Length 

Gate  Width,  L 

800nm 

Body  biasing  Vsb 

OV 

Gate  thickness,  H 

20nm 

Body  Doping,  Na 

lel/cnr3 

Gap  Thickness, 

tgap 

9.7nm 

Surface 
Roughness,  d0 

O.lnm 

Table  3.2:  NEMFET  device  parameters  used  for  the  Euler-Poisson  simulation 
study  in  this  work. 
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3.3  Results  and  Discussion 


3.3.1  Mechanical  Gate  Shape  and  Channel  Potential 

The  static  characteristics  of  a  lOOnm  (channel  length)  NEMFET,  with 
device  parameters  as  shown  in  Table  3.2,  were  simulated  using  the  Euler-Poisson 
model.  Figs.  4a-4e  show  the  equivalent  gap  thickness  g(x)  and  the  channel 
potential  contours  (fix,y)  for  Vgs=0V  and  1.6V  (before  pull-in),  Fp“=l  .8 14V  (just 
below  the  pull-in  voltage),  Fp+j=1.814V(just  above  the  pull-in  voltage),  and 
Fri=0.308V,  for  Vs b=0V  and  r=0  pj/m2.  At  Vgs=0V,  the  mechanical  gate  is  slightly 
deflected  due  to  the  electrostatic  force  induced  by  the  built-in  voltage  (-Fa);  since 
the  gap  thickness  g(x)  is  non-unifonn,  the  channel  potential  also  varies  with  x. 

As  Vgs  increases,  the  channel  eventually  becomes  inverted  (to  be  n-type). 
The  channel  potential  contours  for  Vgs=l.6V  and  Vgs=  1.814V  are  shown  in  Figs. 
4b-4c.  The  contour  line  for  <f(x,y)=2<fc>=0. 8349V  is  shown  in  each  plot  to  delineate 
the  inversion  region.  Since  the  gate  capacitance  varies  with  x,  the  region  of  the 
channel  near  x=L/ 2  reaches  strong  inversion  first,  as  Vgs  is  increased.  The  lateral 
extent  of  the  inversion  region  then  spreads  as  Vgs  is  further  increased,  as  can  be 
seen  from  Fig.  3.4c. 

For  FgS>Fpi  (Fig.  3.4d),  the  gate  is  pulled  in,  with  its  central  portion  in 
contact  with  the  gate  dielectric.  If  Vgs  were  to  be  reduced  back  toward  OV  within 
this  regime  of  gate-beam  operation,  the  length  of  the  contacting  region  would 
decrease;  at  Vgs=V ri,  the  gate  would  only  be  in  contact  with  the  gate  dielectric  at 
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x=LI. 2  (Fig.  3.4e).  Any  further  reduction  in  Vgs  would  cause  the  gate  to  be  released 
from  the  gate  dielectric. 

As  will  be  explained  below,  the  magnitudes  of  Vvi  and  Vr\  and  the  channel 
condition  at  these  gate  bias  voltages  depend  on  various  device  parameters  including 
the  body  doping  concentration,  gate  work  function,  mechanical  gate  properties,  gap 
thickness,  source-to-body  voltage,  and  surface  adhesion  force. 


Fig.  3.4  (a) 
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Fig.  3.4  (b) 


Fig.  3.4  (c) 
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Fig.  3.4  (d) 


x[nm]  0  100  200  300  400  500  600  700  800 


Fig.  3.4  (e) 

Fig.  3.4  :  Simulated  equivalent  gap  thickness,  g(x),  and  semiconductor  potential 
contours  (f>(x,y)  for  different  gate  voltage  biases:  a)  Vgs=0V,  b)  Vgs=l.6V,  c)  Vgs=Vpir 
=1. 8141V,  d)  Vgs=  VPi+=  1.8141V,  e)  Fg,=  F,./=0.308V. 
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3.3.2  Impact  of  body  doping  concentration,  Na 


The  threshold  voltage  (Ft)  of  a  conventional  MOSFET  can  be  tuned  by 
adjusting  the  body  doping  concentration  Na:  in  order  to  increase  Ft,  Na  is 
increased.  In  contrast,  for  a  NEMFET,  Fpi  and  Vx\  are  independent  of  Na  below 
some  critical  concentration  level;  above  this  level,  Fpi  and  Vx\  decrease  with 
increasing  Na  (Fig.  3.5).  To  explain  this  dependence  of  Vv\  and  Vx\  on  Na,  we  plot 
(ps  at  x  =  LI 2  against  Na,  for  Fgs  =  Fpi  and  Fgs  =  Vr\  in  Fig.  3.6a  and  Fig.  3.6b, 
respectively.  The  (/)s=2<j)^  line,  which  corresponds  to  the  onset  of  strong  inversion  in 
the  channel  at  x=LI 2,  is  also  shown  for  reference.  The  (ps  curves  for  Fgs  =  Fp,  and 
Fgs  =  Fri  intersect  the  2(j^  reference  line  at  the  body  doping  concentrations  AVo.pi 
and  AVd,h,  respectively.  For  Mi>ATn.Pi,  the  surface  channel  is  “fully  depleted” 
(since  <ps<2(f>h  for  all  x)  when  Fgs  =  Fpi.  If  Na<  ATD.pi,  4>s  at  x  =  LI 2  increases  by  60 
mV  for  every  1  Ox  increase  in  Na;  on  the  other  hand,  if  Na>NFD,Pi,  <fis  at  x  =  LI 2 
decreases  very  rapidly  (>60mV/decade)  with  increasing  Na.  A  similar  dependence 
on  Na  is  seen  for  (psatx  =  LI 2  with  Fgs  =  Vx\.  In  the  subsequent  analysis,  we  refer  to 
the  case  where  Na<  Afd, pi  (or  M<  Mdji)  as  “inversion  pull-in  (or  inversion  release)” 
and  the  case  where  A''a>Aj  i).Pi  (or  Na>  A^fdji)  as  “sub-threshold  pull-in  (or  sub¬ 
threshold  release)”1. 


1  Strictly  speaking,  there  also  exists  the  case  where  pull-in/release  occurs  when 
the  semiconductor  surface  is  only  moderately  inverted  [3.16],  As  was  well 
discussed  in  [3.16],  analytical  models  for  this  region  of  operation  are  rather 
complicated;  for  simplicity,  our  lumped  parameter  model  does  not  include  the 
“moderate  inversion  pull-in/release”  case. 
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2.2 


2.0 


1.E+13  1.E+14  1.E+15  1.E+16  1.E+17  1.E+18  1.E+19  1.E+20 
Body  Doping  Concentration  Na  [cm-3] 

(a) 


1.E+13  1.E+14  1.E+15  1.E+16  1.E+17  1.E+18  1.E+19  1.E+20 
Body  Doping  Concentration  Na[cnr3] 

(b) 


Fig.  3.5:  Dependence  of  (a)  Vpi  and  (b)  Vr\  on  the  body  doping  concentration  Na. 
The  results  of  the  lumped-parameter  model  (dotted  line)  are  also  shown  for 
comparison. 
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1.E+13  1.E+14  1.E+15  1.E+16  1.E+17  1.E+18  1.E+19  1.E+20 
Body  Doping  Concentration  Na[cnr3] 


(a) 


Body  Doping  Concentration  Na[cnr3] 

(b) 

Fig.  3.6:  Dependence  of  the  semiconductor  surface  potential  at  x=L/2  on  Na,  for  a 
gate-to-source  bias  of  (a)  Vpi  and  (b)  V,  \.  For  reference,  <f>s(L/2)=2<f>b  is  also  plotted. 

a.  Inversion  pull-in/release 
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For  zero  body  bias,  Vg,=V^+Vtff+(ft,.  At  Vgs=VPi,  Veff  can  be  estimated  using 
Eqn.  (3.4).  In  strong  inversion,  the  depletion  capacitance  is  screened  by  the  free 
carriers  in  the  channel;  thus,  the  negative  feedback  stabilization  [3.18]  vanishes  and 
therefore 


Veff  = 


8  k[tgap+tKox 


) 

KOX ) 


27  £0WL 


V-ni 


picc 


(3.12) 


«  =  ;(w+^)  (3-i3) 

where  Vv{cc  is  the  pull-in  voltage  of  a  conventional  clamped-clamped  beam.  The 
total  areal  charge  in  the  semiconductor  at  Vgs  =  Vpi  is  given  by  the  equation 


Qs  =  ~ 


So Veff 


3SoVpicc 


2(W+k“) 
Then  (fts  can  be  found  from  Eqn.  (3.5): 


(3.14) 


0S  =  Min  ( - )  =  k-^\n  (  9£oVT  )  +  ^ln  M  (3.15) 

<7  \8Ksin2 (tgap+j^)  kBTj  Wsinig20kBTj  q  W 


which  implies  dl^N  ^  ~  60 mV /dec,  consistent  with  Fig.  3.6a. 


By  adding  Fa,  Fcn;  and  (ft,  together,  we  obtain  the  following  expression  for  Fpi: 


v-.  =  c-f\  „{^)  +  Vpicc  + 


'pi 


8Ksinig^kBT 

The  expression  for  Vr\  can  be  derived  similarly: 


fc»rln(  9£ov  \  + 

\8Ksinig^kBTj  q  Vn;/ 


(3.16) 


At  Vgs  —  Vri : 


Veff  - 


2ktgap(tox/K  ox)2 


Qs  = 


e0WL 

£o^eff  f  2kt 


gapt-o 


( Xox/ K  ox) 


WL 


(3.17) 


ktgapNa  \ 
^  Q  \WLKslkBTn2J 
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vrl  =  C  -  —In 


( ^a\  _|_  l^ktyapCtox/ K  ox)2 
WN  27f0VKL 


+ 


kbT  jn  f  ktgap  \ 
.  q  \WLKsikBTnJ 


(3.18) 


Eqns.  (3.16)  and  (3.18)  explain  why  both  Kpi  and  Vx\  are  independent  of  Na  for 
inversion  pull-in/release:  any  increase  in  (f>s  due  to  an  increase  in  Na  is  compensated 
by  a  reduction  in  the  flat-band  voltage, 
b.  Sub-threshold  pull-in/release 

For  sufficiently  large  Na,  pull-in/release  occurs  before  the  semiconductor 
surface  becomes  strongly  inverted.  As  shown  in  Fig.  3.5,  both  Vv\  and  Vr\  decrease 
rapidly  with  increasing  Na  in  this  case.  This  is  because  the  depletion  capacitance 
Cdep  increases  with  Na.  To  understand  this  qualitatively,  we  can  use  the  capacitive 
divider  model  as  shown  in  Fig.  3.1c: 

dVeff  _  cdep 
dVgs  Cgap+Cdep 


Thus,  for  a  given  Vgs,  K-ir  increases  with  increasing  Na  so  that  a  smaller 
value  of  FgS  is  needed  to  achieve  a  certain  electrostatic  force  required  for  pull-in  (or 
release),  and  hence  Vpi  (or  Vri)  decreases  with  increasing  Na.  This  is  consistent  with 
previously  published  work  [3.3],  Notice  also  that  for  intennediate  body-doping 
concentrations,  pull-in  occurs  when  the  semiconductor  surface  is  moderately 
inverted;  therefore,  as  depicted  in  Fig.  3.7,  negative  feedback  stabilization  [3.18] 
reduces  the  pull-in  gap  thickness. 

For  very  high  body  doping  concentration,  </>s(x )  approaches  zero.  Eqn.  (3.7) 
then  becomes: 
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gj  d4g(x) 
dx 4 


(3.20) 


E0w{v-vfb)2 
2g(x)2 

I  Eh3g3 

and  Vpi-Vfh=V0X  converges  to  11.7 - -f-  —  2V;  the  pull-in  gap  thickness  at  x=Lf 2 

also  converges  to  0.603g,  as  indicated  in  Figs.  5a  and  7.  This  is  exactly  the  same  as 
the  pull-in  voltage  and  the  pull-in  gap  thickness  of  a  clamped-clamped  beam  with  a 
metallic  actuation  electrode,  which  is  not  surprising  (since  degenerately  doped 
silicon  is  a  conductive  material)  and  is  consistent  with  the  results  obtained  using  the 
lumped  parameter  model  [3.3], 


1.E+13  1.E+14  1.E+15  1.E+16  1.E+17  1.E+18  1.E+19  1.E+20 
Body  Doping  Concentration  Na[cnr3] 


Fig.  3.7:  Dependence  of  the  gap  thickness  at  x=L/ 2  on  Na  for  a  gate-to-source  bias 
equal  to  Vpi. 

i.  Minimum  Na  for  sub-threshold  pull-in 
The  values  of  M  o.pi  and  A'fd.fI  are  of  interest  to  design  the  NEMFET  for 
different  (digital  vs.  analog)  applications.  If  pull-in  occurs  at  the  onset  of  strong 
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inversion,  (ps  at  x  =  LI 2  for  Vgs  =  Vpi  is  2(f)h  and  the  total  areal  charge  in  the 
semiconductor  is 


Qs  =  V2£siqiVFD,pi(20,)  =  J4esiqNFD>pi^fln(^)  (3.21) 

By  equating  Eqn.  (3.12),  (3.14)  and  Eqn.  (3.21),  we  obtain  the  following  equation: 

N FD,pi  j  (NFD  pi\  _  k{tgap  +  Kqx) 

Tit  Ui  )  6nikbTKsiWL  '  '  ' 

k(tnap+—) 

Letting  = - ,  the  solution  to  Eqn.  (3.22)  is 

&  6  riikbTKsiWL  n  v  / 

N FD.pi  —  ni  cc/lcirnbertW(a)  (3.23) 

where  m=lambertW(n)  is  the  Lambert  W  function,  the  solution  to  the  equation 
n  =  mem. 


Following  the  same  procedure,  A^D,ri  can  similarly  be  found: 
NFD,ri  =  ni  y/lcimbertW  (y) 

where  y  — 


(3.24) 


kt 


gap 


2TlikbTKsiWL 


Although  the  exact  values  of  /Vnxpi  and  AFD,ri  can  only  be  obtained 
numerically,  we  can  still  note  that  since  a  and  y  are  always  positive,  /V|-api  and 
AFD.ri  are  monotonically  increasing  functions  of  a  and  y,  respectively.  Thus,  the 
minimum  body  doping  concentration  required  for  sub-threshold  pull-in/release 
operation  is  larger  for  a  stiffer  gate,  a  larger  air-gap,  or  a  smaller  actuation  area. 
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3.3.3  Impact  of  Gate  Work  Function 


In  a  conventional  MOSFET,  the  gate  work  function  &m  can  be  used  to  tune 
the  threshold  voltage  [3.19,  3.20],  Similarly,  the  gate  work  function  can  be  used  to 
tune  the  pull-in  and  release  voltages  of  a  NEMFET,  since  the  built-in  voltage 
between  the  gate  and  the  semiconductor  channel  induces  an  electric  field.  Since 
Vgs  —  Vj-b  +Ve ff  +  <ps  and  V tb=  (Pm-  (Ps,  both  the  pull-in  voltage  and  the  release 
voltage  shift  linearly  with  @m. 

3.3.4  Impact  of  Gate  Stiffness 

The  stiffness  of  the  mechanical  gate  depends  on  the  gate  dimensions 
(W,h,L)  and  the  gate  material  Young’s  modulus  (E).  As  was  discussed  in  [3.21], 

I  Eh3 g3 

these  parameters  can  be  lumped  into  a  single  parameter  Vpicc  —  1 11.7  the 

pull-in  voltage  of  a  conventional  gap-closing  actuator  with  metallic  electrodes 
[3.22].  Fig.  3.8  plots  Vpi  and  Vr\  for  different  values  of  Fpicc.  Increasing  FPjCC 
increases  the  gate  stiffness,  hence  both  Fpi  and  Vr\  increase  with  Fpicc  over  the  entire 
range  of  body  doping  concentration.  Note  that  AVrxPi  and  ATari  also  increase  with 
the  gate  stiffness,  as  expected  from  Eqns.  (3.23)-(3.24).  This  is  because  a  higher 
body  doping  concentration  is  needed  to  increase  the  value  of  Fgs  at  which  the 
surface  becomes  strongly  inverted  to  be  equal  to  Vpi  (or  F,-i). 

2Note  that  Vpicc  depends  on  g0,  the  as-fabricated  equivalent  gap  thickness.  But  g0  not 
only  changes  Vpicc  ,  but  also  the  equivalent  gate-oxide  thickness  of  the  built-in 
transistor,  as  indicated  by  the  Poisson  equation  (Eqn.  (3.9)).  In  this  subsection,  we 
assume  a  constant  gQ  as  we  change  Vpicc.  The  impact  of  gQ  on  Vpi  will  be  discussed 
in  detail  in  the  following  subsection. 
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3.3.5  Impact  of  As-Fabricated  Air-Gap  Thickness,  tgap 

It  is  well  known  that  a  reduction  in  the  gap  thickness  can  improve  the 
transduction  efficiency  of  an  electrostatically  actuated  device  [3.23].  Similarly,  a 
reduction  in  the  gap  thickness  reduces  the  pull-in  voltage  of  a  NEMFET.  Fig.  3.9 
plots  Vpi  and  Vr\  for  various  values  of  effective  as-fabricated  actuation  gap 
thickness,  g0.  A  decrease  in  g0  reduces  Vpi  in  two  ways:  first,  it  reduces  Vpicc  of  the 
doubly-clamped  beam;  second,  for  a  given  applied  gate  voltage,  it  increases  the 
channel  surface  potential  <f>s  and  hence  the  electric  field  across  the  equivalent  air 
gap  (Eqn.  3.9).  Vx\  also  decreases  because  the  spring  restoring  force  decreases 
linearly  with  decreasing  tgap. 
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(b) 

Fig.  3.8:  Impact  of  beam  stiffness  (correlated  to  the  pull-in  voltage  of  a 
conventional  doubly  clamped  beam,  FPiCC)  on  Vv\  and  Vr\.  An  increase  in  beam 
stiffness  increases  the  voltages  required  for  gate  pull-in  and  release. 
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(b) 

Fig.  3.9:  Impact  of  as-fabricated  equivalent  gap  thickness  (tgap  +  t0X/K0X )  on  Vpi 
and  Vri.  A  reduction  in  the  gap  thickness  increases  the  electrostatic  force  on  the  gate 
and  hence  reduces  FPi.  It  also  reduces  the  spring  restoring  force  and  Fpi. 
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3.3.6  Impact  of  Source-to-Body  Bias  Voltage,  Vsb 

Thus  far  in  our  discussion,  we  have  assumed  that  the  source  and  body 
electrodes  are  biased  at  the  same  potential;  however,  this  is  not  necessarily  the  case, 
since  transistors  may  be  connected  in  series.  Since  the  body  terminal  of  an  n- 
channel  FET  is  usually  grounded,  the  source  of  an  n-channel  NEMFET  can  be  at  a 
potential  that  is  higher  than  the  body  potential.  The  source-to-body  voltage  Vsb 
due  to  a  “floating”  source  can  significantly  affect  the  threshold  voltage  of  a 
conventional  transistor.  The  impact  of  Vsb  on  Fpi  and  Vr\  of  a  NEMFET  depends  on 
whether  pull-in/release  occurs  in  the  sub-threshold  or  inversion  regime  of 
operation. 

For  the  inversion  pull-in/release  case,  both  Vpi  and  Vr\  are  independent  of 
Fsb,  as  shown  in  Fig.  3.10.  This  is  because  in  strong  inversion,  although  Vsb  causes 
a  redistribution  of  charge  between  the  inversion  layer  and  depletion  region,  the  total 
charge  in  the  semiconductor  (Qmv  +  Qdep)  remains  unchanged  [3.16]:  Qinv  + 
Qdep  =  Cox(Vgs  ~  Vfb  -  2(pb)  -  Thus,  Veff  (which  determines  the  actuation  force) 
and  therefore  Vpi  and  Vr\  are  all  independent  of  Vsb. 

For  the  sub-threshold  pull-in/release  case,  however,  an  increase  in  Vsh  increases  the 
electric  field  across  the  air  gap;  thus,  the  value  of  Vgs  required  to  achieve  a  given 
surface  potential  and  Ve ff  decreases: 

V^Vfl+A+VepVsb  (3.25) 

Therefore,  Fpi  and  Vr\  decrease  linearly  with  Vsb  : 

Vpi=  Vpi{ Fsb=0)  -  Vsb  ;  Vri=  Vri  hsb=0)  -  Vsb  (3 .26) 
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(a) 


1.E+13  1.E+14  1.E+15  1.E+16  1.E+17  1.E+18  1.E+19  1.E+20 
Body  Doping  Concentration  Na[cnr3] 

(b) 


Fig.  3.10:  Impact  of  source-to-body  voltage,  VSb,  on  Vpi  and  Vr\.  Both  Vpx  and  VT\ 
show  no  dependence  on  Vsb  if  pull-in/release  occurs  in  the  inversion  region  of 
operation.  Vp\  and  Vt\  decrease  with  increasing  Vsb  if  pull-in/release  occurs  in  the 
inversion  region  of  operation. 
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As  can  be  seen  from  Fig.  3.10,  Vfd.pi  and  M-o,ri  also  depend  on  Vsb.,  for  the 
sub-threshold  pull-in/release  case.  At  the  onset  of  inversion,  ^=2^+Fsb.  Following 
the  derivation  in  Eqns.  (3.28)-(3.30),  A'Vn.pi  can  be  determined  from  the  following 
equation: 

/  Wsb\ 

N FD,Pi  =  nta / lambertW  I  ae2kbT  I  (3.27) 

Similarly,  /Vfd.i-i  can  be  derived:  NFD  rl  —  n£  y /lambertW (je2kbT)  (3.28) 

If  Fsb>  0,  the  gate  voltage  required  for  the  semiconductor  channel  surface  to 
reach  inversion  is  increased.  Thus,  the  range  of  body  doping  concentration  for  sub¬ 
threshold  pull-in/release  is  increased,  i.e.  iVpD.pi  and  VFD,ri  each  decrease  with 
increasing  Vs\,. 

3.3.7  Impact  of  Surface  Adhesion  Force 

Thus  far  in  our  discussion,  surface  forces  have  been  ignored;  but  for  the 
small  air-gap  thicknesses  assumed  herein,  the  Van  der  Waals  force  should  be 
considered.  Fig.  3.11  plots  Vpj  and  Vr\  for  various  values  of  interfacial  adhesion 
energy  per  unit  area.  As  expected,  an  increase  in  the  adhesion  energy  decreases  the 
Vpi  and  Vr\  values.  For  the  NEMFET  device  parameters  used  in  this  work,  the 
spring  restoring  force  is  roughly  96nN,  which  is  significantly  greater  than  the 
surface  adhesion  force  (~3.2nN  for  r=2pj/m2  [3.15]  with  do=0.1nm).  Thus,  the 
impact  of  surface  force  on  Vv\  and  V,  \  is  negligible. 
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(b) 

Fig.  3.11:  Impact  of  surface  adhesion  energy,  T,  on  Vpi  and  Vx\.  An  increase  in  the 
surface  adhesion  force  decreases  both  Vpi  and  Vr\ . 
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3.4  Unified  Model  for  Vp,  and  Vr\ 


A  primitive  sub-threshold  pull-in  model  was  derived  in  [3.3],  Here  we  add 
the  impact  of  body  biasing  and  expressions  for  7VFD,pi  and  A'FD.ri,  in  order  to  make 
the  model  complete: 

3.4.1  Pull-in  Voltage  Vpi 


VPi-Vjb+(j)s  +Veff-Vsb 


(3.29) 


where 


0s  =  { 


9  enNnV‘ 


—in  i  '  °  :r  i  +  K 


q  \8Ksin?gZkBTj 


sb 


k  En 


18  q3K3siN^(WL)2 


1+1  + 


6WL{KsiqNa)2g0 


k£n 


Na  —  NpD.pi 


Na  >  NpD.pi 


(3.30) 


^ptcc 

Veff  ~  j  2  ( g0j2q£siNa4>s 
la  V 


Nq  —  NpD 


pi 


0s)  Na 


>  N. 


FD,pi 


(3.31) 


(  SKsb\  k(t Qav+—)  [ 

NFB.pi  =  n.a/lambertW  a  =  Vpl„  = 


3.4.2  Release  Voltage  Vri 


Vrl~  Vft,+  <t>s  +Veff-Vsb 


(3.32) 


where 


0s 


(  ktaapNa  2)  +  Vsb 
\WLKsikBTn?J  SD 

ktgap 


WLKsiqNa 


Na  —  NpDri 
Na  >  NFD  ri 


_  j  2ktgap(tOX/KOX)2 

v*ff  ~  J  T^Tl 


(3.33) 


(3.34) 


78 


/  SKsb 

NFD,ri  —  niY / lambertW  I  ye2kbT 

To  ensure  that  the  effect  of  surfaces  forces  is  insignificant,  the  spring 
restoring  force  must  be  much  greater  than  the  surface  adhesion  force.  Eqns.  (3.29)- 
(3.34)  can  be  used  to  design  the  NEMFET  in  order  to  achieve  the  desired  values  of 
Fpi  and  Vr\  for  a  particular  application. 

3.5  NEMFET  Scalability 

The  introduction  of  a  thin  air-gap  in  a  NEMFET  drastically  improves  the 
effective  sub  threshold  slope.  However,  such  an  air-gap  also  drastically  decreases 
the  gate-to-channel  capacitive  coupling  in  the  off-state,  making  the  device  more 
susceptible  to  short  channel  effects  [3.16]  than  the  conventional  MOSFET.  Results 
from  quasi  2-D  analysis  [3.16]  indicate  that  the  minimum  channel  length  of  a 
MOSFET  for  a  given  short  channel  control  is  correlated  to  the  characteristic  length 
(A,),  which  is  given  by  the  following  equation: 

A  =  l^teotXj  (3.35) 

Aj  £ox 

where  teot  is  the  equivalent  gate  oxide  thickness,  Xj  is  the  source-drain  junction 
depth.  Assuming  the  NEMFET  and  the  MOSFET  have  the  same  source-drain 
junction  depth,  the  NEMFET  channel  length  needs  to  be  at  least  Anemfet/Amosfet 
-y/lOnm  x  3.9/1  nm  ~  6  times  longer  than  a  MOSFET  with  a  lnm  thick  SiO?  gate 
oxide  for  the  same  short  channel  control.  This  would  drastically  increase  the  circuit 
layout  area  and  hinder  compact  logic  implementation. 


bV  = 


kt 


gap 


2nikt,TKsiWL 
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The  NEMFET  limited  scalability  remains  an  issue  for  logic  applications; 
but  the  built-in  transistor  gain  makes  the  device  very  attractive  for  sensor  and 
resonator  applications  where  layout  area  is  not  as  critical.  To  fully  harness  the 
benefits  of  the  NEMFET,  the  designs  of  the  mechanical  gate  and  the  intrinsic 
transistor  must  be  co-optimized.  As  an  example,  consider  a  NEMFET  resonator 
where  the  resonant  motion  of  the  gate  is  sensed  by  the  modulated  drain-to-source 
current.  The  resonant  frequency  can  be  adjusted  by  changing  the  gate  dimensions 
and  the  motional  current  can  be  maximized  by  optimizing  the  transistor 
transconductance.  To  achieve  this  goal,  though,  techniques  are  required  that  are 
beyond  the  scope  of  this  thesis. 

3.6  Summary 

The  Euler-Bernoulli  equation  is  solved  consistently  with  the  Poisson 
equation  to  model  the  behavior  of  a  nano-electro-mechanical  field  effect  transistor. 
Using  this  Euler-Poisson  model,  the  shape  of  the  deflected  gate  electrode  and 
channel  potential  profile  across  the  width  of  the  channel  for  various  gate  voltages 
can  be  studied.  The  dependence  of  the  pull-in  voltage  ( Vpi)  and  gate  release 
voltage  ( Fr|)  on  the  body  doping,  gate  work  function  engineering,  gate  stiffness,  as- 
fabricated  gap  thickness,  the  body  bias  and  surface  force  are  examined.  The  Euler- 
Poisson  model  well  matches  the  results  of  the  unified  model  for  Vpi  and  Vt\,  which 
is  provided  to  aid  NEMFET  design  for  digital  and  analog  applications. 

To  circumvent  the  limited  scalability  issue  of  the  NEMFET,  a  better 
transistor  design  is  needed  to  fully  harness  the  benefits  of  the  abrupt  pull-in  effects 
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without  sacrificing  short  channel  control;  with  this  goal  in  mind,  micro-relays  for 
logic  applications  are  discussed  in  Chapter  4. 
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Chapter  4 


Design  and  Reliability  of  Micro- 
Relays  for  Logic  Applications 

4.1  Introduction 

Chapter  3  presented  the  operation  and  design  of  the  NEMFET,  which 
harnesses  the  pull-in  effect  to  achieve  a  perfectly  abrupt  off-to-on  switching 
transition.  The  presence  of  the  air-gap  in  the  off-state  severely  limits  NEMFET 
scalability,  however.  In  face  of  this  limitation,  micro-electro-mechanical  relays 
(“micro-relays”)  [4. 1-4.5]  appear  to  be  an  attractive  alternative  for  zero-standby 
power  logic  applications.  The  attractiveness  of  micro-relays  stems  from  the  fact 
that  a  mechanical  switch  offers  nearly  ideal  switching  characteristics:  zero  off  state 
drain-to-source  and  gate  leakage  currents,  and  perfectly  abrupt  off-to-on  switching 
transition.  Since  there  is  no  trade-off  between  off-state  leakage  current  and  on-state 
drive  current,  the  relay  threshold  voltage  and  therefore  Vdd  can  in  principle  be 
reduced  much  more  aggressively  than  for  a  MOSFET  in  order  to  improve  the 
energy  efficiency. 

In  terms  of  device  structure  and  operation,  a  micro-relay  for  digital  logic 
(dubbed  “logic  relays”)  applications  is  very  similar  to  that  for  used  for  radio- 
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frequency  signal  DC  switching  (dubbed  “RF  relays”)  applications  [4. 6-4. 9]. 
However,  the  relay  contact  resistance  requirements  for  these  two  applications  are 
drastically  different.  For  RF  relays  in  which  achieving  ultra-low  on-state  resistance 
(Ron  <  IQ)  is  the  primary  target,  the  relay  design  (e.g.  device  dimensions, 
contacting  electrode  material,  contact  force,  etc.)  is  optimized  to  achieve  the 
required  metal-to -metal  contact  conductance.  In  contrast,  for  logic  relays,  Ron  can 
be  as  high  as  10-100kf2  (for  load  capacitance  of  10-lOOfF)  because  the  switching 
delay  of  a  relay-based  circuit  is  dominated  by  the  mechanical  pull-in  time  (foi, 
typically  10-  100ns)  rather  than  the  electrical  RC  delay  (foe)  [4.2].  Since  extremely 
high  endurance,  fast  switching  speed,  high  energy  efficiency  and  high  layout 
density  are  necessities  in  this  application  while  relatively  high  Ron  can  be  tolerated, 
the  design  space  for  RF  and  logic  relays  are  drastically  different. 

To  date,  no  systematic  design,  optimization,  and  scaling  methodology  for 
logic  relays  has  been  proposed.  To  remedy  this  issue,  this  chapter  begins  with  a 
general  description  of  relay  structure  and  operation  in  Section  4.2,  followed  by 
contact  design  considerations  and  a  tungsten-based  relay  technology  discussed  in 
Section  4.3.  Section  4.4  then  presents  the  calibrated  relay  delay  and  energy  models 
which  are  used  to  develop  a  sensitivity-based  energy-delay  optimization  and 
scaling  methodology  in  Chapter  5. 

4.2  Relay  Structure  and  Operation 

In  terms  of  device  structure  and  operation,  a  logic  relay  is  very  similar  to 
larger  micrometer-scale  mechanical  switches  that  have  been  developed  for  radio 
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frequency  electronics.  Fig.  4.1  shows  the  schematic  3D  view  of  the 
electrostatically-actuated  relay  structure  and  indicates  several  important  relay 
features  and  design  parameters.  As  shown,  the  relay  is  comprised  of  a  movable 
actuation  plate  (the  “source”)  supported  at  the  four  corners  by  four  support  beams, 
each  of  which  is  anchored  to  the  substrate.  Folded-flexure  beams  are  purposely 
used  to  ensure  the  relay  design  is  robust  against  residual  thennal  stress  and  vertical 
strain  gradient  effects.  The  position  of  this  actuation  plate  depends  on  the  electric 
field  across  the  actuation  gap  (thickness  g)  between  the  gate  and  the  source. 


Contact  Dimples 


Fig.  4.1  A  schematic  3D  view  of  the  electrostatically-actuated  relay  structure. 


88 


—  Contact  Dimples  — 

' 

' 

l _ ^ 

— i  i — o 

t, - 1 

□  c 


□  c 


□ 


Le  “Pinned” 

(b)  On  State  (VGS  =  VDD) 


Fig.  4.2  ANSYS  simulated  displacement  contours  and  the  schematic  cross-sectional 
view  of  the  source  electrode  in  the  (a)  Off  state  and  (b)  On  state. 

In  the  off  state  (Fig.  4.2a),  an  air  gap  separates  the  source  from  the  metallic 
drain  electrode  so  that  no  current  can  flow.  In  the  on  state  (Fig.  4.2b)  where  the 
gate-to-source  voltage  is  greater  than  the  pull-in  voltage  ( Fpi),  the  actuation  plate 
comes  down  and  rests  upon  the  metal-coated  contact  dimples;  the  source  is  in 
contact  with  drain  electrode,  providing  a  conductive  path  for  current  to  flow.  Since 
the  relay  switches  on  abruptly  as  |  Fgb|  is  increased  above  Vp„  the  h-Vg  characteristic 
of  the  relay  exhibits  an  extremely  steep  (nearly  infinite)  subthreshold  slope.  Note 
that  since  electrostatic  force  is  ambipolar,  the  equivalent  of  an  n-channel  MOSFET 
(which  turns  on  when  a  positive  bias  voltage  is  applied  to  the  gate)  or  a  p-channel 
MOSFET  (which  turns  on  when  a  negative  bias  voltage  is  applied  to  the  gate)  can 
be  achieved  with  the  same  switch  design  by  appropriately  biasing  the  source 
electrode.  Thus,  electro-mechanical  switches  can  mimic  CMOSFETs  and  can  be 
used  to  implement  digital  logic  circuitry  accordingly. 
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Although  three  terminal  relays  are  discussed  in  this  section,  many  types  of 
micro-relays  have  been  described  in  the  literature,  with  variations  in  the  mechanical 
spring  design,  the  location  of  the  air-gap  actuator,  number  of  electrode  terminals 
[4.3]  and  the  orientation  of  beam  deflection.  While  this  paper,  unless  specified, 
mainly  focuses  on  the  three-tenninal  relay  design,  the  analysis  and  design 
techniques  presented  here  can  easily  be  extended  and  applied  to  any  specific  relay 
design. 

4.3  Reliable  Micro-Relay  Technology 


Despite  the  relay  ideal  switching  behavior  alluded  to  previously,  the 
principal  challenge  facing  micro-relay  designers  is  achieving  good  reliability  while 
maintaining  sufficiently  low  contact  resistance.  The  contact  resistance  is 
determined  by  material  properties  and  is  limited  by  asperities  on  the  contacting 
metallic  surfaces,  and  can  be  computed  by  the  following  equation  [4.10]: 


-  _L  (1+0-83*p/a\  P_ 


ftnN  —  _  ,  i 


(4.1) 


0N  3na2  '  \l+1.33Ap/aJ  2a 

where  a  is  the  radius  of  the  contact  asperities,  p  and  Ap  are  the  resistivity  and 
electron  mean  free  path  of  the  contact  material,  respectively. 

The  area  of  the  contact  asperities  Ax  is  a  function  of  the  material  hardness  H, 
the  deformation  coefficient  £,  at  the  contact  and  also  the  loading  force  Fc,  which  is 
approximately  the  electrostatic  force: 

Ar  —  na2  =  ^  (4.2) 

where  1  >c>0  and  c  is  inversely  proportional  to  the  loading  force. 
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Because  the  on-state  conductance  of  a  micro-relay  is  limited  by  asperities 
on  the  contacting  metallic  surfaces,  for  RF  relays  applications,  one  can  infer  from 
Eqns.  (4.1)  and  (4.2)  that  soft  contacting  electrode  materials  (esp.  gold)  and  a  large 
applied  load  to  plastically  deform  [4.1 1-4.14]  the  contacting  surfaces  ( i.e .  such  that 
the  metallic  material  liquefies)  are  preferred  in  order  to  minimize  asperities  and 
achieve  the  required  contact  resistance  (<lfi).  However,  this  makes  high  endurance 
very  difficult  to  achieve  with  such  designs  [4.13].  In  contrast,  for  logic  relays,  Ron 
can  be  as  high  as  10-lOOkQ  (for  load  capacitance  of  KMOOfF)  because  the 
switching  delay  of  a  relay-based  circuit  is  dominated  by  the  mechanical  pull-in 
time  (q>i,  typically  10-lOOns)  rather  than  the  electrical  RC  delay  (/rc)  [4.2].  Since 
extremely  high  endurance  is  a  necessity  in  this  application  while  relatively  high 
Ron  can  be  tolerated,  hard  contacting  electrode  materials  [4.15-4.17]  ( e.g .  tungsten) 
and  operation  with  low  contact  force  [4.15-4.17]  are  preferred.  In  addition,  a 
surface  treatment  can  be  applied  to  reduce  adhesion  [4.16,  4.18]  as  long  as  Ron  is 
not  increased  beyond  ~10kQ.  With  these  considerations  in  mind,  we  have 
developed  a  reliable  relay  technology  suitable  for  digital  logic  applications  using 
titanium  dioxide  (TiCF)  coated  tungsten  (W)  electrodes.  The  TiCF  coating  limits  the 
current  flow  at  the  contact  asperities,  and  the  relay  contact  can  therefore  better 
avoid  issues  with  contact  welding.  TiC>2  is  also  a  high  electron  affinity  dielectric 
(q%Ti02=4.2eV),  and  thus  presents  only  a  moderate  potential  barrier  to  electron 
conduction  from  W,  i.e.  it  degrades  on-state  conductance  least  among  common 
dielectric  materials. 
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Fig.  4.3  4  -mask  process  flow  used  to  fabricate  prototype  micro-relays  (A- A’  cross- 
section). 


Parameter 

Value 

Parameter 

Value 

Young  Modulus,  E 

145GPa 

Electrode  Length,  LE 

15pm 

Shear  Modulus,  G 

57GPa 

Truss  Width,  WT 

5pm 

Density 

4126kg  m  3 

Truss  Length,  LT 

12pm 

Beam  Width,  W 

5pm 

Beam  Thickness,  h 

1pm 

Beam  Length,  L 

{10,  ...,50}xpm 

Fabricated  Gap  Thickness,  g 

200nm 

Actuation  Plate  Width,  Wa 

30pm 

Dimple  Gap  thickness,  g^ 

lOOnm 

Actuation  Plate  Length,  LA 

27  pm 

Dimple  Area,  AD 

2x{4, 10,1 5,25}xpm2 

Table  4.1.  Relay  device  parameters 

Relays  with  parameters  shown  in  Table  I  were  fabricated  on  oxidized  Si 
wafer  substrates  as  follows.  Amorphous  silicon  (which  promotes  adhesion  of  W  to 
SiC>2)  and  W  layers,  each  50  nm  thick,  were  sequentially  deposited  by  sputtering  and 
then  patterned  to  form  the  drain  and  gate  electrodes  (Fig.  4.3a).  lOOnm-thick  LTO 
was  then  deposited  at  400°C  as  the  first  sacrificial  layer.  Contact  dimple  regions 
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were  then  fonned  by  optical  lithography  and  dry  etching  (Fig.  4.3b).  After  the 
deposition  of  a  2nd  lOOmn-thick  LTO  sacrificial  layer  (Fig.  4.3c),  a  50nm-thick  W 
layer  was  sputtered  and  patterned  to  form  metallic  contacting  electrodes  (Fig.  4.3d). 
Next,  a  1  pm-thick  structural  layer  of  in-situ  boron-doped  polycrystalline  Si0.4Geo.6 
was  deposited  at  410°C  [4.19].  The  Sio.4Geo.6  layer  was  then  patterned  (Fig.  4.3e) 
and  the  structures  were  released  (Fig.  4.3f)  with  a  timed  isotropic  oxide  etch  using 
vapor  49%  hydrofluoric  acid  at  27°C.  Immediately  afterwards,  the  entire  relay 
structure  was  coated  with  TiCF  at  275°C  using  12  cycles  of  atomic  layer  deposition 
(ALD).  One  ALD  cycle  consists  of  one  pulse  of  titanium  tetrachloride  (TiC’U) 
followed  by  Ti  oxidation,  and  deposits  -0.25A  of  Ti02.  Note  that  since  the 
maximum  process  temperature  is  410°C,  this  relay  technology  is  suitable  for 
fabrication  over  CMOS  circuitry  [4.20]  or  on  glass  substrates. 


Fig.  4.4  Plan  view  scanning  electron  micrograph  of  a  relay  before  the  release  step. 
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4.4  Results  and  Discussion 


Several  logic  relays  with  beam  length  L  in  the  range  from  10  to  50fnn  were 
fabricated  using  the  process  flow  detailed  in  Section  4.3;  Fig.  4.4  presents  a  plan- 
view  scanning  electron  micrograph  (SEM)  of  a  prototype  L  =20um  relay.  All 
measured  relays  (>200)  were  found  to  be  functional.  Before  actual  measurement, 
each  fresh  device  is  first  electrically  “burned  in”  by  switching  it  on/off  with  a 
relatively  high  drain-to-source  voltage  (Fds=1~2.5V)  several  times  [4.21].  This  will 
break  the  native  tungsten  oxide  and  other  contaminants  on  the  contacting  surfaces 
to  obtain  a  clean  and  stable  contact  resistance.  Measured  /d-Fgs  and  /d-Fds 
characteristics  of  a  L= 40pm  relay  are  plotted  in  Fig.  4.5.  As  expected,  the  relays 
have  zero  off  state  drain-to-source  and  gate  leakage  currents.  Abrupt  switching  is 
seen  at  Fgs=5.35V  and  RoN=8.1kQ;  the  measured  Ron  matches  well  with  the  contact 
resistance  model,  which  predicts  Ron  ~10kQ  for  Fc=0.89pN  at  Fgs=5.35V,  using 
TiCF  resistivity  of  0.26Q-cm,  tungsten  hardness  of  l.IGPa  [4.22],  /.p=33nm  and 
£=0.3  [4.22].  As  depicted  in  Fig.  4.6,  the  relay  can  endure  1.25  billion  on/off  “hot” 
switching  cycles  with  Fds=1V  in  N2  ambient  without  stiction  or  welding-induced 
failure,  making  tungsten  an  attractive  candidate  for  reliable  relay  contact  materials. 

With  a  pathway  to  enable  reliable  micro-relay  for  digital  logic  applications 
experimentally  demonstrated,  the  following  section  aims  to  develop  and  calibrate 
relevant  models  and  analytical  formulations  for  relay  performance  (e.g  pull- 
in/release  voltages,  switching  speed  and  energy)  in  order  to  facilitate  design 
optimization  for  relay-based  circuits. 
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4.4.1  Static  Performance  Analysis 


a.  Pull-in  and  Release  Voltages 

Since  the  relay  is  operated  in  the  pull-in  mode  (gd>g/3),  hysteretic  switching 
behavior  is  observed  as  indicated  by  Fig.  4.5.  Analytically,  the  pull-in  voltage  Vn, 
and  release  voltage  V,\  can  be  calculated  as: 


where  £eff  is  the  effective  spring  constant  of  the  folded-beam,  A  is  the  actuation  area 
:  A  =  Ze  x  W a  and  Fa  is  the  surface  adhesion  force.  Note  that  in  Eqn.  4.3,  non-ideal 
effects  such  as  fringing  capacitance  and  actuation  area  reduction  due  to  the  release 
holes  are  assumed  to  be  negligible. 


95 


1.E-04 


1.E-06 
g  1.E-08 

C/5 

_Q 

1.E-10 

1.E-12 

1.E-14 

0  1  2  3  4  5  6  7 


VGS  [V] 


(a) 


VDS  [V] 


(b) 

Fig.  4.5(a)  Measured  /D-  Tgs  and  (b)  /D-  FDs  characteristics  for  a  relay  with  L  =40  mm. 
The  two  drains  are  externally  connected  . 
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Fig.  4.6  Measured  contact  resistance  vs.  number  of  on/off  hot  switching  cycles  (at 
Fds=1  V  to  mimic  scaled-relay  operation).  Due  to  the  formation  of  tungsten  native 
oxide,  contact  resistance  is  unstable. 


Serpentine  z  displacement  [nm] 


Fig.  4.7  (a)  ANSYS-simulated  displacement  contours  of  the  source  electrode  in  the 
on-state,  (b)  the  spring  exhibits  both  bending  and  torsional  motions  when  the 
actuation  pad  moves. 
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As  indicated  by  the  ANSYS-simulated  displacement  contours  of  the 
actuation  plate  shown  in  Fig.  4.7,  the  spring  exhibits  both  bending  and  rotational 
motions  when  the  plate  moves.  The  exact  Air  model  that  accounts  for  shear 
displacements  and  rotary  inertias  is  rather  complicated  [4.23];  by  sacrificing  some 
degree  of  accuracy,  Aff  can  be  rendered  into  a  more  intuitive  fonn,  which  consists  of 
flexural  (ocl/Z3)  and  torsional  (ocl/Z)  [4.24]  terms: 


1  f  EWh3\  1  ,  f  GWh3\ 

+(n— j 


-l 


(4.4) 


ceff  ^ 

where  yf  and  yt  are  the  flexural  and  torsional  constants.  By  using  ANSYS,  yf  and  yt 

10  2 

are  found  to  be  3.66  and  1.341x10  in'"  respectively.  As  indicated  in  Fig.  4.8,  the 
analytical  model  is  within  10%  of  ANSYS  simulation;  Eqn.  4.3  predicts  Vpi  values 
within  10%  of  the  measured  data,  as  shown  in  Fig.  4.9 

To  ensure  that  the  relay  can  be  turned  off,  i.e.  F,i>0,  the  spring  restoring  force 
must  be  sufficient  to  overcome  surface  adhesion  force,  FA.  FA  is  dominated  by  the 
metal-to-metal  contacts  and  it  can  be  estimated  by  expressing  Vr\  as  a  function  of  Vpi: 

(4.5) 

rL  4  g  V  g  J  Pl  enA  A  v  ’ 


The  average  FA  is  extracted  to  be  0.45pN  for  a  dimple  area  (AD)  of  2x  1 0pm“,  with 
individual  relays’  extracted  FA  varying  by  ±1  pN  around  this  value,  as  indicated  in 
Fig.  4.10.  The  extracted  FA  value  matches  well  with  results  obtained  by  atomic 
force  microscopy  [4.25].  Using  the  extract  FA  value,  Eqn.  4.3  predicts  Vv\  values 
within  10%  of  the  measured  data,  as  shown  in  Fig.  4.1 1. 


98 


Fig.  4.8  Modeled  ke ff  values  match  ANSYS  simulation.  As  L  decreases,  torsional 
motion  dominates. 


L[H 


Fig.  4.9  Measured  relay  Vpi  vs.  L.  Vpi  decreases  as  L  increases,  as  expected.  The 
measured  data  is  within  10%  of  the  analytical  model 
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Vpi2[V2] 


Fig.  4.10  Vr2  vs.  Vpi2  for  the  relays  of  Fig.  4.9.  FA  is  extracted  to  be  0.45|JN. 


L[|am] 

Fig.  4.11  Measured  relay  Vx\  vs.  L.  The  measured  data  is  within  10%  of  the  analytical 
model. 
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Finally,  as  previously  alluded  to,  the  serpentine  spring  design  ensures  that 
the  relay  is  robust  against  thermal  stress  effects  through  expansion;  therefore,  the 
measured  values  of  Vp„  Vr\  and  Ron  do  not  significantly  change  with  temperature, 
as  shown  in  Fig.  4.12  and  4.13.  The  negative  Young’s  modulus  temperature 
coefficient  account  for  the  slight  reduction  in  Fpi  and  Vr\  values. 
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Fig.  4.12  a)  Measured  Vpj  and  Vr\  vs.  T  for  L=1 7pm.  (b)  ANSYS  shows  that  the 
folded  spring  design  releases  the  thermaFresidual  stress  by  expansion. 
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Temperature  [°C] 

Fig.  4.13  Measured  contact  resistance  vs.  temperature  for  a  L=40|Jm  relay, 
b.  Dimple  Support  Design 

When  the  relay  is  turned  on,  the  actuation  plate  comes  down  and  rests  upon 
the  contact  dimples  (Fig.  4.2b);  proper  relay  design  involves  not  only  the  choice  of 
relay  device  dimensions  that  give  the  desired  pull-in  and  release  voltages,  but  also 
includes  device  dimensions  that  prevent  the  actuation  pads  from  catastrophically 
pulling  in  and  shorting  the  electrodes.  To  determine  the  catastrophic  pull-in  voltage 
(Fcpi)  value,  exact  information  for  the  contact  dimples  (e.g.  surface  roughness, 
contact  friction,  etc.)  is  needed,  which  is  complicated.  Fortunately,  sacrificing  some 
degree  of  accuracy,  a  conservative  Vcpi  estimation  can  once  again  be  obtained  by 
modeling  the  actuation  pad  as  a  simply-supported  beam  pinned  at  its  two  dimples, 
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as  indicated  in  Fig.  4.2b.  The  bending  of  the  actuation  plate  can  be  estimated  by  the 
Euler-Bernoulli  equation  [4.24]: 


£qWaV2 

2  (g-ga-z*)2 

0 


in  actuation  region 
otherwise 


(4.6) 


where  v  is  the  Poisson  ratio  of  the  structural  material.  E!{  1  -v2)  instead  of  the  Young 


Modulus  E  is  used  to  account  for  the  plate  effects;  Ia  is  the  moment  of  inertia 
=WAhi/ 12,  and  z*  =  z  -  gd. 


Eqn.  (4.6)  is  solved  with  the  appropriate  boundary  conditions  at  both  dimple  ends: 


z*(x  —  0)  —  z*(x  —  La )  -  0  ; 


d2z* 


dx2 


x=0 


d2z* 


dx2 


=  0 


x=La 


(4.7) 


which  gives  the  analytical  fonnulation  for  Vcpi  : 

t j  ~  i  cir  \Eh3ig-ga )3 
V‘V‘  =  15 16) 


(4.8) 


Eqn.  4.8  predicts  a  Vcpi  value  of  14.6V;  ANSYS  simulation  gives  Fcpi=18.9V.  As 
previously  alluded  to,  Eqn.  4.8  is  a  conservative  estimate  of  Fcpi;  therefore,  the 
calculated  Fpcj  is  lower  than  the  measured  value,  which  is  23V. 


4.4.2  Dynamic  Performance  Analysis 


In  addition  to  the  static  voltages,  circuit  designers  are  particularly  interested 
in  the  mechanical  delay  time  as  it  determines  the  circuit  performance.  When  a  bias 
voltage  (TO  is  applied  between  gate  and  source  electrodes,  the  motion  of  the 
actuation  plate  is  governed  by  Newton’s  Second  Law  of  Motion,  which  yields  the 
following  second  order  differential  equation  [4.26]: 

..  Vfce//"le//  . 


meffZ  +  Q 


-z  +  kPffZ  = - — 

eJt  2  (5-z)2 


(4.9) 
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The  right  side  of  the  equation  is  the  electrostatic  force,  Q  is  the  quality  factor;  z  is 
the  actuation  plate  displacement,  and  meff  is  the  effective  mass  of  transport. 

To  develop  a  rigorous  delay  model,  it  is  worthwhile  to  spend  some  effort  in 
deriving  an  accurate  analytical  fonnulation  to  the  effective  mass.  To  reach  this 
goal,  we  first  note  that  the  effective  mass  consists  of  the  mass  of  the  actuation  plate 
and  the  loaded  mass  from  the  springs,  and  can  be  determined  from  the  total  kinetic 
energy  in  the  relay  ( KEXoi )  and  the  velocity  of  the  actuation  plate  (vp)  [4.27]: 

4P""• 


_  KE,„t  _  V,,^+>r'f+;/ 'T'"").  _  |  . 

"Ipff  —  i  o  —  - 13 -  —  Trip  T  VTit  ^  T 


Leff  ~  —V2 

„  17  n 


[zb(y)l2 


2  2  ^ 

(4.10) 

where  m’s  and  v’s  are  the  masses  and  velocities,  and  the  subscripts  p,  t  and  b 
denote  the  actuation  plate,  trusses  and  the  folded  beams,  respectively.  The  trusses 
are  displaced  by  a  distance  z/2,  and  therefore, 

(4.11) 

Zb(y)  is  the  beam  deflection.  For  beam  AB,  as  depicted  in  Fig.  4.14, 

z»w  =  z[!(r)2-(r)]  <412» 


1 

2  'Vv 


And  for  beam  CD 


Zb  (y)  =  ^ 


(4.13) 


Substituting  Eqns.  (4.1 1)-(4.13)  into  (4.10),  zneff  is  given  by: 


meff  =  mp+^mt  +  ^mb 


(4.14) 
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Substitute  the  plate,  truss  and  beam  dimensions  into  Eqn.  (4.14),  me s  can  be 
simplified  and  expressed  by: 

meff  —  a0pAh  +  a1pWLh  (4.15) 

where  a0=1.93  and  ai  =2.74;  ANSYS  simulation  predicts  a0=1.43and  ai=3.83. 


xx 


Truss 


Fig.  4.14.  The  effective  mass  of  the  folded  spring  can  be  estimated  by  the  beam 
kinetic  energy,  which  is  detennined  by  the  mode  shape  of  segment  AB  and  CD. 


Substituting  Eqn.  (4.4)  and  Eqn.  (4.15)  into  Eqn.  (4.9),  the  plate  position  is 
obtained  by  solving  the  resultant  equation.  But  in  order  to  provide  insight  for  relay 
design,  we  analytically  approximate  /delay  by  the  following  expression: 

tdelay  =  (fj  -  x)  "  for  SVpi  >  Vdd  >  l.lVpi ,  gd  >  g/3 

(4.16) 
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where  %=  0.8;  a  ,  /?  and  y  ,  to  the  first  order,  depend  only  on  Q  and  their  values  are 
plotted  in  Fig.  4.15.  The  details  of  the  derivation  are  discussed  in  Appendix  I.  Over 
the  range  of  interest,  Eqn.  (4.16)  predicts  /delay  values  within  20%  of  Eqn.  (4.9),  and 
the  accuracy  improves  as  Fdd/Fpi  increases.  Note  that  Eqn.  4.16  shows  that  /delay  not 
only  depends  on  the  relay  resonant  frequency ^ kef f  / mef  f  ,  but  also  on  the  Fdd/Fpi 
ratio,  which  from  now  on  we  denote  it  as  the  “gate  overdrive”. 


Fig.  4.15a.  Dependence  of  a  on  quality  factor,  for  5Vpi>  Fdd>  1-1  VPi,  gd>g/3 
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Fig.  4.16  (a)  Relay  delay  measurement  setup.  Measurements  were  made  in  N2 
ambient  (b)  The  total  delay  fatal  can  be  extracted  from  the  difference  between  the 
input  and  output  signals,  (c)  faiay=  fatal  -fable,  where  /cable  is  the  electrical  delay  of  the 
cable. 

To  calibrate  the  delay  model,  relay  delay  measurements  were  taken  in 
nitrogen  ambient  using  the  test  setup  shown  in  Fig.  4.16.  As  shown  in  the  figure,  a 
load  resistor  is  utilized  in  this  measurement  setup  to  allow  for  application  of  a  step 
voltage  to  the  gate  electrode  and  measure  to  trigger  a  voltage  drop  (delayed  by  fatal) 
at  the  drain  electrode  (Fig.  4.16b).  To  separate  the  cable  delay  (fable)  from  the  pull- 
in  time,  fable  is  first  measured  using  an  on-chip,  dummy  metal  line  (Fig.  4.16c); 
/delay  can  then  be  extracted  from  the  difference  between  fatal  and  fable-  Utilizing  this 
measurement  setup,  Fig.  4.17  shows  the  measured  delay  of  three  relays  with 
L=  14pm,  40pm  and  50pm  at  different  faid  values,  faiay  decreases  with  increasing 
Fdd  and  I /farr ,  as  expected.  Note  that  no  contact  bounce  is  observed;  measured  faiay 
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values  match  predictions  using  Eqn.  4.9  within  20%,  with  an  extracted  quality 
factor  value  Q  =  0.3. 

Finally,  since  the  relay  is  purely  electrostatically  actuated,  all  of  the  energy 
consumed  by  switching  it  on  and  off  is  supplied  by  the  voltage  source  driving  the 
gate  electrode  and  any  wiring  and  load  capacitance  at  the  drain  electrode.  The  total 
energy  consumed  in  switching  the  relay  on  and  off  is  simply  set  by  the  charge 


supplied  by  the  supply  voltage: 


(4.17) 


where  Cl  is  extrinsic  load  capacitance.  With  a  reliable  relay  technology,  together 


with  accurate  static  and  dynamic  models  properly  developed,  we  are  now  ready  to 


optimize  relay  based  digital  circuits  and  project  the  performance  of  scaled  relays. 


These  topics  will  be  addressed  in  Chapter  5. 


Vdd  [V] 


Fig.  4.17  Measured  fa  vs.  FDd  for  three  different  relays. 
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4.5  Conclusion 


A  pathway  to  enable  reliable  micro-relays  for  digital  logic  applications 
through  proper  contact  design  is  proposed  and  demonstrated  with  TiC>2-coated  W 
contacting  electrodes.  Prototype  relays  fabricated  utilizing  a  CMOS-compatible, 
poly-Sio.4Geo.6  surface  micromachining  process  have  been  demonstrated  to  operate 
at  near  5V  supply  voltage  with  reasonable  (~1  OkQ)  contact  resistance  over  a  wide 
temperature  range  (20  °C  -200°C).  The  devices  have  low  surface  adhesion  energy 
(~4pJ/m  )  and  can  endure  more  than  10  on/off  hot-switching  cycles  in  N2  ambient 
without  stiction-  or  welding-induced  failure.  In  order  to  facilitate  design 
optimization  and  scaling  for  relay-based  circuits,  relevant  models  and  analytical 
formulations  for  relay  performance  are  also  developed  in  this  chapter.  It  should  be 
note  that  the  relatively  high  supply  voltages  demonstrated  in  this  chapter  do  not 
necessarily  represent  the  ultimate  voltage  limit  for  relay  technology.  Through 
proper  relay  energy-delay  optimization  and  device  scaling,  scaled  relays  will  offer 
substantial  reduction  in  supply  voltage  and  improvements  in  energy  efficiency  over 
CMOS.  These  topics  will  be  addressed  in  chapter  5. 
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Chapter  5 


Optimization  and  Scaling  of 
Micro-Relays  for  Logic 
Applications 

5.1  Introduction 

With  a  pathway  to  reliable  micro-relay  for  digital  logic  applications 
experimentally  demonstrated  in  Chapter  4,  the  focus  of  this  Chapter  is  on  achieving 
fast,  energy-efficient,  and  compact  relay-based  digital  circuits.  To  date,  no 
systematic  optimization  and  scaling  methodology  for  logic  relays  has  been 
proposed.  To  remedy  this  issue,  this  Chapter  begins  with  a  sensitivity-based 
energy-delay  optimization  methodology  in  Section  5.2,  allowing  the  establishment 
of  simple  relay  design  guidelines.  Based  upon  these  guidelines,  together  with  the 
measured  adhesion  force  scaling  with  contact  dimple  area,  we  then  propose  a 
scaling  methodology  for  micro-relays  in  Section  5.3,  which  leads  to  systematic 
improvements  in  device  density,  perfonnance,  and  energy  consumption. 
Simulation  results  indicate  that  scaled  relay  technology  may  offer  >10x 
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improvement  in  energy  efficiency  for  applications  requiring  perfonnance  up  to 
~100MHz.  Finally,  section  5.4  concludes  this  chapter  . 

5.2  Relay  Energy-Delay  Optimization 


Having  developed  the  relevant  relay  energy-delay  models  in  Chapter  4,  the 
focus  of  this  chapter  is  now  on  optimizing  relay-based  circuits.  As  was  described  in 
[5.1]-[5.3],  the  optimal  topology  for  relay  circuits  is  drastically  different  from  that 
for  CMOS  circuits.  Specifically,  because  the  delay  of  relay-based  circuits  is 
dominated  by  the  mechanical  delay  rather  than  the  electrical  RC  delay,  an 
optimized  relay-based  circuit  should  (as  shown  in  Fig.  5.1)  consist  of  single  stage 
complex  gates  [5.2,  5.4]  such  that  the  mechanical  motions  of  all  the  relays  in  the 
circuit  occur  simultaneously1. 

In  optimizing  the  relay  design  for  the  appropriate  circuit  topology,  it  is 
important  to  note  that,  as  for  CMOS  circuit  design  [5.5-  5.10],  relay  energy  and 
performance  trade  off  against  each  other.  The  goal  of  the  optimization  is  to 
minimize  relay  circuit  delay  subject  to  a  given  energy  budget,  which  in  essence 
boils  down  to  solving  the  following  constrained  optimization  problem  for  an  N- 
relay  stack: 


Minimize:  tdeiay  =  a 


-P 


Subject  to:  Etot  =  If=1  +  CL  i)  Vjd 

\y  yd.  ' 


(5.1) 


3  Note  that  this  only  work  for  four-terminal  relays. 
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«dT“ 

Input  1  to  N 

r 


N  Inputs 

_ i _ 


_!_  _L  _L  -L  _L  ' 

Vdd  J  ^  f  ^  f  ^  Output 


(a) 


CL  CLJ  CL>3^"  Cl.jmT  Cln"^ 


(b)  Vdd 


Input 

-L  A=Atot/N 


CL=Ctot/N  ^ 


Fig.  5.1  (a)  Optimal  relay  circuit  topology,  (b)  The  energy-delay  optimization 
problem  can  be  simplified  to  optimizing  one  single  relay  driving  the  average 
capacitance  Ctot/N 

where  Etot  is  the  total  energy;  A,  and  Cu  are  the  area  and  the  load  capacitance, 
respectively,  of  the  z-th  relay  in  the  stack.  Assuming  all  relays  have  the  same 
fabricated  gap  thickness  g  and  dimple  gap  thickness  gd,  the  total  energy  can  be 
rewritten  as: 


Etot  =  +  Cut)  VL  (5.2) 

where  Atot  and  Ctot  are  the  total  actuation  area  and  load  capacitance,  respectively. 
Given  that  the  electrical  delay  is  negligible  when  compared  against  the  mechanical 
delay,  then  by  symmetry,  an  optimized  relay  circuit  design  would  size  all  relays 
identically  so  that  the  devices  have  the  same  switching  delay  and  energy 
consumption: 
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(5.3) 


Es  =  ^  =  (—  +  CL)  K|d  where  Atot  =  N  x  A,  CL=Ctot  /N 

Therefore,  the  energy-delay  optimization  problem  is  now  reduced  to  optimizing  the 
energy-delay  of  a  single  relay  driving  one  capacitive  load  Cl,  where  Cl  is  the 
average  capacitance  driven  by  each  relay.  To  solve  this  constraint  optimization 
problem,  we  first  apply  the  sensitivity-based  analysis  to  explore  the  relay  energy- 
delay  tradeoff. 

5.2.1  Sensitivity  analysis 

Sensitivity-based  analysis  for  CMOS  circuit  optimization  has  been  extensively 
explored  in  [5.5-5.10]  to  explore  digital  integrated  circuit  energy-delay 
optimization.  The  optimization  can  equally  applied  to  optimize  relay  circuits, 
therefore  the  key  concepts  are  here  briefly  reviewed.  The  energy-delay  sensitivity 
of  a  given  tuning  variable  var  is  defined  as: 

^  (5.4) 

var  dEs/dvar  v  ’ 

which  is  interpreted  as  the  delay  reduction  per  energy  cost  by  adjusting  variable 
var.  For  most  relay  designs,  the  beam  width  W  is  the  minimum  feature  size  set  by 
photolithography;  thickness  h  and  the  minimum  dimple  gap  thickness  gd  are  set  by 
process  constraints.  Once  the  thickness  is  fixed,  the  quality  factor  and  therefore  the 
a,  p  and  y  values  are  known.  Therefore,  the  supply  voltage  Cid,  actuation  area  A, 
fabricated  gap  thickness  g  and  the  beam  length  L  are  the  available  design  variables 
for  optimization.  By  adjusting  these  variables,  the  optimal  relay  design  is  reached 
when  the  sensitivities  to  all  tuning  variables  are  balanced  [5.5,  5.6,  5.9].  With  these 
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goals  in  mind,  we  herein  derive  the  analytical  formulations  for  sensitivities  to  these 
variables. 

i.  Sensitivity  to  Supply  Voltage 

We  begin  our  analysis  by  exploring  the  sensitivity  of  delay  to  energy  due  to 
the  change  in  Fid-  As  Fki  increases,  the  switching  delay  decreases  because  the 
electrostatic  force  increases.  Of  course,  the  switching  energy  also  increases  with 
Fdd.  With  the  power-law  dependences  of  both  delay  and  switching  energy  on  Fid, 
the  resultant  negative  sensitivity  to  supply  voltage  is  given  by  Eqn.  5.5: 

dtdelay/dVddL  _  _  fit  delay  ^  ( Fid\  /  / v  dd  _  A 
dEs/dVdd  2ES  \Vpi)l  \Vpi  ) 

Where  /delay  and  Es  are  respectively  the  nominal  delay  and  energy  at  a  given  Fdd. 

Note  that,  as  will  be  discussed  later,  typical  a  Fid/  Fpi  value  lies  within  the  range 

£ 

1.5-3.  Therefore  the  normalized  sensitivity  [5.11],  which  is  defined  as  — —  x 

tdelay 

dt^ aJdv  Vdd’  'S  rou8h|y  —(0-7- 1 . l)xp.  Hence,  as  a  simple  rule  of  thumb,  every  2X 


energy  increase  can  be  sacrificed  for  ~1.6-1.8X  reduction  in  relay  delay  by  Fid 
adjustment. 

ii.  Sensitivity  to  Actuation  Area 

In  addition  to  Fid  adjustment,  relay  sizing  is  also  an  effective  means  to 
adjust  the  tradeoff  between  energy  and  delay.  The  sensitivity  of  delay  to  energy  due 
to  the  change  in  the  actuation  area  is  given  by  Eqn.  5.6: 


dtdelay/dA  _  tdelay 

dEs/dA  ~  2  Es 


a0phA 


L  meff 


KWfe-f)  K1  +  C4^))  C5.6) 
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As  the  actuation  area  A  increases,  the  on-state  capacitance  and  therefore  the 
switching  energy  increases.  On  the  other  hand,  increasing  A  has  a  two-fold  impact 
on  the  relay  switching  speed:  the  pull-in  time  decreases  due  to  the  reduction  in  Vpi; 
but  the  increase  in  the  actuation  mass  counteracts  this  effect.  For  high-Q  relays  (i.e. 
small  P),  as  will  be  discussed  later,  the  resonant  frequency  term  dominates 
switching  delay.  Therefore,  reducing  the  actuation  area  is  attractive  because  it 
decreases  both  the  switching  delay  and  energy,  leading  to  a  positive  sensitivity.  But 
it  is  important  to  note  that  the  sensitivity  diminishes  with  reduction  in  actuation 
area  because,  the  gate  overdrive  decreases  and  the  spring-loaded  mass 

becomes  increasingly  significant  in  determining  the  resonant  frequency.  All  these 
eventually  lead  to  a  negative  sensitivity. 

iii.  Sensitivity  to  Fabricated  Gap  Thickness 

In  designing  micro-relays,  it  is  desirable  to  use  use  the  thinnest  dimple  gap 
(gd)  to  minimize  the  travelling  distance  of  the  actuation  plate.  A  thinner  fabricated 
actuation  gap  (g)  also  provides  for  larger  actuation  force.  However,  it  is  important 
to  note  that  reducing  the  fabricated  gap  thickness  increases  the  on-state  capacitance 
and  therefore  the  switching  energy.  This  results  in  a  negative  sensitivity  to  g,  as 
shown  by  Eqn.  5.7: 


dtdelay/dd 

dEs/dg 


t delay  ^  p  _  ^ 


1 +  CL 


iv.  Sensitivity  to  Beam  Length 


Finally,  if  the  beams  do  not  contribute  substantial  capacitance,  then  to  first 


order,  relay  switching  energy  is  independent  of  the  beam  length.  Therefore,  the 
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beam  length  can  be  optimized  to  maximize  the  switching  speed  with  no  energy 
penalty.  Before  doing  so,  it  is  important  note  that  the  pull-in  time  depends  not  only 
on  the  resonant  frequency,  but  also  on  the  gate  overdrive.  Therefore,  increasing  the 
beam  length  has  a  dual  impact  on  the  relay  switching  speed:  the  pull-in  time 
decreases  due  to  the  reduction  in  Vpi,  but  the  increase  in  the  loaded  mass 
counteracts  this  effect.  Therefore,  there  exists  an  optimal  beam  length  that  properly 
balances  the  resonant  frequency  and  the  gate  overdrive: 


dt  delay 
dL 


=  o  =>  ><  [i  ' r 

IVo  dL  )'  \vpi  dL  )\  A  \vpi) 


=  P 


(5.8) 


where  the  left-hand  side  if  the  equation  is  proportional  to  /«effx[l-x(fdd/Tpi)"1]- 
Therefore,  for  high -Q  relays  (i.e.  low  [1  value),  short  beams  are  preferred  to 
decrease  the  mass  and  increase  the  relay  resonant  frequency.  For  low-0  relay,  on 
the  other  hand,  one  would  prefer  to  use  long  beams  to  decrease  Vpi  and  therefore 
increase  the  gate  overdrive. 


5.2.2  Relay  Design  Optimization 


With  analytical  expressions  for  the  sensitivities,  the  relay  design  can  now  be 
optimized.  As  previously  alluded  to,  this  goal  can  be  reached  by  balancing  the 
sensitivities  of  all  of  the  design  variables.  This  means  that  we  can  use  the 
nonnalized  sensitivity  to  supply  voltage  (-(0.7-1 .  l)xp),  to  pick  all  other  design 
variables.  It  also  implies  that  for  an  optimized  relay,  every  2x  energy  increase  can 
be  sacrificed  for  ~1.6-1.8x  reduction  in  relay  delay  by  changing  any  design 
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variable.  Using  this  result,  simple  guidelines  for  energy-efficient  relay  design  are 
established  in  this  section. 


i.  Optimal  Gap  Thickness  Ratio 

We  begin  the  optimization  process  by  exploring  the  optimal  gjg  ratio.  By 
balancing  the  sensitivities  to  V({<i  and  to  g, 

-l<p  =  -(i-f)a+n  {-y  +  \v)  (5.9) 

where  (p  —  /?  f— )  / f—  —  x\  f  ~  Q/(  ~  )  is  the  fan-out  of  the  relay.  Since 
\VpiJ/  VTpi  )  V7  9d  ' 

y~0.3  and  P~l,  y«1.5cp,  the  optimal  gd/g  ratio  is  approximated  by  Eqn.  5.10: 


gd  _  2+3/ 
g  ~  3  +  3/ 


(5.10) 


Therefore  the  optimal  gd/g  ratio  is  roughly  0.66-0.75  across  a  large  fan-out  range, 


and  this  value  is  largely  independent  of  the  quality  factor  value  (P).  This  implies 


that  pull-in  operation  is  preferred  for  optimal  energy  efficiency.  These  results  are 


consistent  with  the  simulation  results  shown  in  Fig.  5.2. 
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1.E-14  1.E-13 

Load  Capacitance  CL  [F] 

Fig.  5.2.  The  optimal  relay  gjg  ratio  stays  roughly  at  2/3,  i.e.  pull-in  operation  is 
preferred,  for  a  wide  range  of  design  parameters. 


ii.  Optimal  Actuation  Area  and  Supply  Voltage 

Once  the  gjg  ratio  is  known,  the  relay  can  be  sized  optimally  if  the  optimal 
relay  fan-out  is  known.  Although  the  exact  analytical  formulation  for  optimal  fan¬ 
out  is  complex,  since  mechanical  delay  dominates,  the  effect  of  the  relay  sizing  on 
the  RC  delay  of  the  gate  input  signal  is  negligible.  As  a  result,  all  that  is  left  is  the 
quadratic  dependence  of  energy  on  Fdd  vs.  the  linear  dependence  on  gate 
capacitance.  If  the  relay  has  to  drive  a  load  capacitance  under  a  given  energy 
constraint,  it  would  be  preferable  to  upsize  the  relay  and  lower  Fdd  to  reduce  the 
energy  spent  on  the  load  capacitance.  In  doing  so,  the  pull-in  voltage  (Fp;  oc  A  ) 


drops  at  a  faster  rate  than  the  supply  voltage  ( Vdd  oc 


oc  (1  +  A)  0,5  ),  and 
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results  in  an  increase  in  the  gate  overdrive.  Together  with  the  fact  that  delay  is 
relatively  insensitive  to  sizing  (tdeiayrcA(1'w/2~A01),  it  can  be  concluded  that  it  is 
worthwhile  to  upsize  the  relay  gate  in  order  to  allow  for  an  overall  reduction  in 
delay.  Therefore  the  optimal  fan-out  is  less  than  one,  as  shown  in  Fig.  5.3. 

Once  the  optimal  values  for  the  fabricated  gap  thickness  and  actuation  area 
are  obtained,  the  optimal  Fid  is  simply  set  by  the  energy  constraint.  The  upper 
bound  for  Fdd  value  is  the  catastrophic  pull-in  voltage. 


1.E-14  1.E-13 

Load  Capacitance  CL  [F] 

Fig.  5.3.  Optimal  fan-out  for  a  relay  for  a  range  of  design  parameters. 

iii.  Optimal  Fdd/  Fpi  and  beam  length 

As  previously  alluded  to,  the  beam  length  dictates  the  balance  between  the 
relay  gate  overdrive  and  the  resonant  frequency.  It  can  be  shown  at  the  optimal 
beam  length,  the  gate  overdrive  and  the  resonant  frequency  are  balanced  when 
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(5.11) 


.  ( .  a0phA\ 

(p  —  1  —  K  1 - - - 

V  meff  ) 

Where  cp  —  /?  j  X^j  and  k  =  ■  K  's  roughly  -1/3  for 

most  relay  designs. 

To  compute  the  optimal  gate  over  drive,  once  needs  to  know  aphA/m^ 
ratio.  Optimal  aphA/meff  ratio  can  be  obtained  by  balancing  the  sensitivities  to  area 
and  to  fabricated  gap  thickness: 

» mm -4-m )h+3imm-x)) 


L  meff 


Since  the  optimal  gjg  value  is  roughly  0.7  and  y~0.3  is  negligible,  the 


optimal  aphA/meff  ratio  is  proportional  to  cp,  as  shown  in  Eqn.  5.12: 
=  (p  (—2  +  3—)  =  O.l^J 

meff  V  g  J 


(5.12) 


Substituting  Eqn.  5.12  into  Eqn.  5.11  the  optimal  gate  overdrive  can  be  obtained: 

/?fe)/fe“/)  =  <?,  =  ^*1-29  (5-13) 


For  low-0  relays  with  [!  >  0.8,  V(\J Epi»2  and  therefore  long  beams  are 
preferred.  However,  it  is  important  to  keep  in  mind  that  the  longest  beam  length 
will  be  set  by  the  surface  adhesion  energy,  or  by  layout  area  constrains.  On  the 
other  hand,  for  high-0  relays  with  [3-0.6,  the  optimal  VaJVv,  ratio  lies  within  the 
range  1.5-2.  Therefore,  shorter  beams  are  preferred.  These  results  are  consistent 
with  the  simulation  results  shown  in  Fig.  5.4  and  Fig.  5.5. 
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1.E-14  1.E-13 

Load  Capacitance  CL  [F] 

Fig.  5.4  The  optimal  gate  overdrive  (  fW Fpi)  value  for  high  and  low-Q  relays. 


1.E-14  1.E-13 

Load  Capacitance  CL  [F] 

Fig.  5.5  The  optimal  relay  beam  length  is  chosen  to  achieve  the  required  gate 
overdrive  (Fig.  5.4);  low-0  relays  have  higher  gate  overdrive  and  therefore  longer 
beams  are  preferred.  Note  that  we  assume  the  longest  beam  length  is  50|Jm  in  this 
study. 
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5.2.3  Relay  Optimization  Example 


Simple  guidelines  for  designing  energy-efficient  relays  have  already  been 
established  in  this  section.  As  a  concrete  example,  the  previously  described 
methodology  is  used  to  optimize  a  5  pm  wide  relay  (with  parameters  shown  in 
Table  I).  The  relay  delay  is  optimized  for  fss=2pj  with  total  load  capacitance 
ranging  from  lOfF  to  lOOfF,  and  the  results  are  shown  in  Fig.  5.6.  As  expected,  the 
delay  increases  with  increasing  load  capacitance.  In  addition,  as  also  previously 
alluded  to,  relays  with  high -Q  values  do  not  provide  substantial  (~2X)  speed 
improvement  over  their  low-0  counterparts. 


1.E-14  1.E-13 

Load  Capacitance  CL  [F] 

Fig.  5.6  For  a  given  energy  constraint  and  load  capacitance,  relay  delay  can  be 
minimized  by  properly  choosing  the  relay  design  parameters.  The  lumped 
parameter  model  is  in  close  agreement  with  the  analytical  model. 
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For  ultra-low  power  electronics  applications  such  as  wireless  sensor 
networks,  switching  energy  [5.12]  (rather  than  speed)  is  the  primary  concern,  and 
hence  the  relay’s  minimum  switching  energy  is  the  most  important  metric.  This 
minimum  switching  energy  is  dictated  by  the  need  to  overcome  surface  adhesion 
energy  (T)  in  order  to  break  physical  contact: 

0.5  keffgd2>r  (5.14) 


This  gives  the  minimum  pull-in  and  supply  voltages: 


^ dd,min  ^ pi, min 


16Tg3 

\27E0Ag2d 


(5.15) 


If  the  load  capacitance  is  ignored,  the  relay  switching  energy  is  expressed  by: 


(— ) 

\g-gj 


Vi 


( 


16Pv  1 

27  )  (i -gd/g){ 


(5.16) 


JS  yg-gj  dd’min  V  27  )  (i-gd/g)(gd/g)2 
which  has  a  minimum  value  of  4T  at  gjg  =  2/3.  Of  course  the  switching  energy 
will  be  higher  for  any  practical  relay  designs  because  a  relay  operating  at  exactly 
Fdd=Fpi  has  no  noise  margin  and  is  highly  susceptible  to  process  or  environmental 
variations. 


5.3  Relay  Scaling 


The  need  for  large  supply  voltage  and  layout  area  remains  an  issue  for  the 
relays  demonstrated  in  this  work,  but  can  be  alleviated  if  the  relay  dimensions  are 
properly  scaled  down.  In  a  manner  very  much  analogous  to  the  classic  scaling 
theory  developed  for  MOSFETS  [5.13],  extensive  treatments  of  constant-field 
scaling  for  micro-electromechanical-systems  (MEMS)  have  been  reported  [5.14], 
This  scaling  methodology  maintains  the  electric  field  across  the  actuation  gap  at  a 
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constant  value  while  all  of  the  dimensions  of  the  device  are  scaled  by  a  factor  S. 
Although  this  simple  scaling  methodology  provides  useful  relay  scaling  insights, 
the  scaled  relay  may  not  correspond  to  the  optimal  design  for  a  given  operating 
situation.  To  remedy  this,  we  expand  upon  prior  work  by  developing  an  optimized 
relay  scaling  methodology  and  assess  its  implications  for  relay  switching  speed, 
energy,  and  layout  area. 


Dimple  Area  [pm2] 

Fig.  5.7  Extracted  average  Fa  (with  standard  deviation  indicated)  vs.  A d.  Each  data 
point  is  obtained  by  measuring  more  than  10  relays  with  different  L  values. 

As  previously  alluded  to,  relay  energy  efficiency  is  determined  by  the 

surface  adhesion  energy.  Furthermore,  the  normalized  sensitivity  is  roughly  -(0.7- 

l.l)xp  for  any  design  variable.  Therefore,  by  focusing  on  the  impact  of  scaling  on 

the  minimum  energy  point,  one  can  automatically  find  out  how  relay  energy- 

performance  in  general  changes  with  technology.  As  depicted  in  Fig.  5.7,  the 
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extracted  surface  adhesion  energy,  which  consists  of  Van  der  Waals  forces, 
capillary  forces,  and  hydrogen  bonds  [5.15],  reduces  with  dimple  contact  areas. 
This  means  that  relay  designs  with  lower  beam  stiffness,  smaller  dimple  gap 
thickness  and  therefore  lower  actuation  area  and  supply  voltage  are  feasible  if  a 
smaller  contact  dimple  area  is  utilized.  With  this  in  mind,  suppose  that  the  contact 
dimple  area  is  reduced  by  a  factor  S 2  and  that  W,  h,  gd  and  CL  are  reduced  by  a 
factor  S.  To  maintain  the  same  optimal  gd/g  ratio  of  0.6-0. 8,  the  fabricated  gap 
thickness  is  reduced  by  S.  As  a  consequence,  the  actuation  area  is  reduced  by  S2  to 
reach  the  same  optimal  fan-out.  Since  the  total  capacitance  is  reduced  by  S  and  the 
switching  energy  improves  by  S  ,  the  power  supply  voltage  can  be  scaled  down  by 
5>.  And  finally,  to  maintain  the  same  optimal  gate  overdrive,  Vv\  is  also  reduced  by 
S0'5;  to  achieve  this  goal,  the  beam  length  is  reduced  by  S4  3.  As  a  consequence,  the 
switching  speed  is  improved  by  (S3  +S,10/3)0'5. 

Ultimately,  for  aggressively  scaled  contacts  (~50x50nm  ),  T  is  set  by 
metal-to-metal  bonding  at  the  contact  asperities  [5.15,  5.16],  with  the  associated 
energy  typically  in  the  0.2aJ/bond  range  [5.16,  5.17].  To  achieve  a  contact 
resistance  less  than  lOkQ,  the  radius  of  the  contact  asperities  is  only  a  few 
nanometers.  Using  the  calibrated  analytical  relay  model  with  scaled  device 
dimensions  and  predictive  model  parameters  listed  in  Table  II,  the  energy 
performance  (Fig.  15)  of  a  relay  in  a  65nm  equivalent  technology  is  simulated  and 
compared  against  that  of  MOSFETs.  For  A d=  5x5nm”  and  the  extracted  area- 
dependent  portion  of  f  (4pJ/nr),  the  minimum  energy  of  the  relay  would  be  set  by 
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the  number  of  bonds.  For  example,  with  five  contact  bonds,  Em[n  =  4aJ  (>10x  lower 
than  CMOS)  would  be  achievable.  Since  relays  have  low  gate  capacitance  due  to 
the  air-gap,  relay  performance  is  very  sensitive  to  load/  wire  capacitance,  as 
depicted  in  Fig.  5.8. 


Performance  [GHz] 

Fig.  5.8  Simulated  energy-performance  comparison  of  the  MOSFET  vs.  relay,  for  a 
30-stage  F04  inverter  chain  /  relay  chain  (average  transition  probability  =0.01)  [8], 
MOSFET  parameters  are  taken  from  the  ITRS,  for  the  65nm  LSTP  technology 
node.  Relay  parameters  are  tabulated  in  Table  II.  The  minimum  energy  is  set  by  T. 
Notice  that  due  to  the  low  air-gap  capacitance,  relay  performance  is  more  sensitive 
to  load  capacitance  than  the  MOSFET  [8]. 
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Parameter 

Value 

Parameter 

Value 

Young  Modulus,  E 

145GPa 

Electrode  Length,  LE 

15pm 

Shear  Modulus,  G 

57GPa 

Truss  Width,  WT 

5pm 

Density 

4126kgnr3 

Truss  Length,  LT 

12pm 

Beam  Width,  W 

5pm 

Beam  Thickness,  h 

1pm 

Beam  Length,  L 

{10,...,50}xpm 

Fabricated  Gap  Thickness,  g 

200nm 

Actuation  Plate  Width,  WA 

30|_im 

Dimple  Gap  thickness,  gd 

lOOnm 

Actuation  Plate  Length,  La 

27|_im 

Dimple  Area,  AD 

2x{4, 1 0, 1 5,25}xpm2 

Table  I.  Relay  device  parameters 


Parameter 

Value 

Beam  Width,  W 

65nm 

Beam  Thickness,  h 

15nm 

Fabricated  Dimple  Gap 
Thickness,  gd 

lOnm 

50x50nm2 

Truss  Width,  WT 

65nm 

Truss  Length,  LT 

156nm 

(  Yf ,  Yt ) 

(2.15  ,  5.13x1013  m-2  ) 

(  a0  ,  a-i  ) 

(1.11  ,  0.5  ) 

Table  II.  Relay  device  parameters  for  65nm-equivalent  technology 

5.4  Conclusion 

Using  the  calibrated  relay  delay  and  energy  models,  a  sensitivity-based 
analysis  was  developed  in  this  Chapter  for  relay  design  optimization,  establishing 
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simple  rules  for  relay  design  (as  shown  in  Fig.  5.9).  It  should  be  noted  that  the 
relatively  high  supply  voltages  used  in  this  work  do  not  represent  the  ultimate 
voltage  scaling  limit  for  relay  technology,  especially  given  that  the  measured 
surface  adhesion  force  reduces  with  smaller  contact  dimple  area.  By  properly 
scaling  down  the  relay  dimensions,  analytical  theory  and  modeling  results  indicate 
that  scaled  relays  can  offer  substantial  reduction  in  supply  voltage,  switching  delay 
and  energy.  Relays  may  therefore  provide  for  dramatic  improvements  in  energy 
efficiency  for  applications  requiring  perfonnance  up  to  —100MHz. 


Fig.  5.9  Design  rules  for  optimal  relay 
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Chapter  6 


Conclusion 

6.1  Summary 

Increasing  power  density  is  a  daunting  challenge  for  continued  MOSFET 
scaling  due  to  the  non-scalability  of  the  thermal  voltage  k^TIq.  To  circumvent  this 
CMOS  power  crisis  and  to  allow  for  aggressive  Vdd  reduction,  many  alternative 
switching  device  designs  have  been  proposed  and  demonstrated  to  achieve  steeper 
than  60mV/dec  subthreshold  swing  ( S ).  This  dissertation  began  with  a  general 
overview  of  the  physics  and  operation  of  these  MOSFET-replacement  devices.  It 
then  applied  circuit-level  metrics  to  establish  evaluation  guidelines  for  assessing 
the  promise  of  these  alternative  transistor  designs.  This  work  shows  that  for  a  given 
performance  target  and  logic  style,  there  exists  an  optimal  I0JI0ff  ratio  to  minimize 
the  total  energy,  and  this  ratio  is  roughly  constant  for  most  MOSFET-like  devices. 
This  implies  that  the  device  effective  subthreshold  swing  (5eff)  value,  rather  than 
the  steepest  local  subthreshold  swing  ( S)  value,  detennines  whether  these  designs 
are  more  energy-efficient  than  MOSFETs.  As  an  example,  we  used  this 
methodology  to  compare  TFETs  against  MOSFETs,  showing  that  TFETs  may  offer 
substantial  (~5x)  energy  savings  for  performance  up  to  the  100MHz  range. 
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This  dissertation  then  investigated  the  abrupt  “pull-in”  effect  of 
electrostatically  actuated  MEMS  to  achieve  perfectly  abrupt  (S~0mV/dec)  tum-on 
switching  behavior  in  NEMFET.  To  facilitate  low  voltage  NEMFET  design,  the 
Euler-Bernoulli  beam  equation  is  solved  simultaneously  with  the  Poisson  equation 
in  order  to  accurately  model  the  switching  behavior  of  NEMFETs.  Using  this 
approach,  the  shape  of  the  movable  gate  electrode  and  semiconductor  potential 
across  the  width  of  the  channel  are  derived  for  the  various  regimes  of  transistor 
operation.  The  impact  of  various  transistor  design  parameters  on  the  gate  pull-in 
voltage  and  gate  release  voltage  are  examined.  A  unified  pull-in/release  voltage 
model  is  developed,  to  facilitate  NEMFET  design  for  both  digital  and  analog  circuit 
applications.  Simulation  results  show  that  by  utilizing  a  lOnm  thick  air-gap, 
NEMFET  operation  with  2V  supply  voltage  is  possible. 

Although  NEMFET  design  with  perfectly  abrupt  turn-on  transitions  are 
achievable,  the  large  equivalent  oxide  thickness  in  the  off-state  due  to  the  presence 
of  the  air-gap  makes  NEMFETs  susceptible  to  short  channel  effects,  and  therefore 
limits  their  scalability.  To  alleviate  this  issue,  this  dissertation  then  proposed  using 
micro-relays  for  logic  applications  because  of  their  ideal  switching  behavior:  zero- 
off  state  leakage  and  perfectly  abrupt  turn-on  transition.  To  mitigate  the  contact 
reliability  issue,  this  dissertation  demonstrated  a  contact  design  methodology  for 
reliable  logic  applications.  Since  relatively  high  f?0N  can  be  tolerated  while 
extremely  high  endurance  is  a  necessity,  hard  contacting  electrode  materials  and 
operation  with  low  contact  force  are  preferred.  Using  this  contact  design  technique, 
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this  work  then  developed  a  reliable  logic  relay  technology  that  employs  titanium 
dioxide  (Ti02)  coated  tungsten  (W)  electrodes.  Prototype  relays  fabricated  using  a 
CMOS-compatible,  poly-Sio.4Geo.6  surface  micromachining  process  were 
demonstrated  to  operate  with  low  surface  adhesion  force,  adequately  low  on-state 
resistance  (f?oN  <  lOOkG)  over  a  wide  temperature  range  (20°C-200°C),  and  >109 
on/off  switching  cycles  in  N2  ambient  without  stiction-  or  welding-induced  failure. 
This  paves  a  pathway  to  realizing  reliable  (endurance  >  10 14  on-off  cycles)  micro¬ 
relays  for  digital  logic  applications. 

Using  the  measured  relay  characteristics,  this  dissertation  then  developed 
and  calibrated  relevant  models  and  analytical  fonnulations  for  relay  performance 
(e.g  pull-in/release  voltages,  switching  speed  and  energy)  to  facilitate  relay  design 
optimization.  A  sensitivity-based  energy-delay  optimization  is  developed,  which  is 
then  used  to  establish  simple  relay  design  guidelines.  Based  upon  these  models,  a 
general  scaling  theory  for  electro-mechanical  switches  is  proposed.  Much  like 
CMOS  transistor  scaling,  switch  miniaturization  leads  to  drastic  improvements  in 
density  (for  lower  cost  per  function),  switching  delay  (for  higher  perfonnance),  and 
power  consumption.  A  scaled  relay  technology  is  projected  to  provide  >10x  energy 
savings  for  circuits  operating  at  up  to  —100MHz. 
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6.2  Recommendations  for  Future  Work 


6.2.1  Tunnel  Field  Effect  Transistor 

Among  all  the  candidate  MOSFET-replacement  devices,  the  TFET  appears 
to  be  the  most  likely  to  be  adopted  due  to  its  simple  transistor  structure  and  its 
resemblance  to  the  conventional  MOSFET.  The  principal  challenge  that  TFET 
designers  face  is  to  achieve  high  on-state  current  while  maintaining  a  low  effective 
subthreshold  slope.  To  reach  this  goal,  the  physics  of  band-to-band  tunneling  for 
various  semiconductor  materials  should  be  understood.  The  simple  model  used  in 
this  thesis  is  sufficient  to  provide  initial  investigation  of  TFET  performance,  but  in- 
depth  quantum-mechanical  studies  are  needed  to  fully  understanding  the  tradeoff 
between  on-state  current  and  effective  subthreshold  slope. 

6.2.2  Electromechanical  Devices 

a.  Nano-Electro-Mechanical  Field  Effect  Transistor 

Since  the  NEMFET  has  limited  scalability  due  to  the  presence  of  the  air- 
gap,  it  is  highly  unlikely  to  be  adopted  for  digital  logic  applications.  However,  the 
built-in  transconductance  gain  eliminates  any  parasitic  loss  and  therefore  makes  it 
attractive  for  sensor  and  resonator  applications.  It  is  also  an  attractive  interface 
device  that  converts  tiny  motions  in  nano-electro-mechanical  systems  into  electrical 
signals  and  transmit  them  to  the  outside  world. 

b.  Micro-Relay 
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This  work  constitutes  an  initial  investigation  to  provide  a  pathway  to 
realizing  reliable  micro-relays  for  digital  logic  applications.  The  contact  reliability 
of  metal-to-metal  contacts  at  the  asperities  needs  to  be  more  fully  understood 
before  its  potential  for  application  in  low-power  electronics  can  be  realized.  To 
achieve  this  goal,  a  technique  for  real-time  contact  characterization  to  monitor  how 
the  contact  asperities  change  with  the  number  of  switching  cycles  is  needed.  Such  a 
technique  would  allow  for  systematic  relay  reliability  studies  at  different  operating 
conditions,  such  as  pressure,  temperature,  etc.  If  relays  can  be  fabricated  with  low 
surface  forces  and  operated  reliably  over  trillions  of  cycles,  relay  technology  could 
potentially  provide  dramatic  improvements  in  energy  efficiency  over  a  wide  range 
of  performance. 

Finally,  with  the  fact  that  early  digital  computers  such  as  ENIAC  are  relay- 
based,  it  is  intriguing  to  see  scaled  relay  technology  can  potentially  lead  integrated 
circuits  “back  to  the  future”  and  revolutionize  computation  technology4. 


4  This  footnote  is  for  future  historians.  On  1 1/08/2005,  in  an  electronic  mail,  I  asked  my  advisor 
Professor  Tsu-Jae  King  Liu  “why  don't  we  just  scale  a  RF  MEMS  switch  down  (to  a  RF  NEMS 
switch)  and  use  it  as  a  logic  device  (thus  no  semiconductor  is  needed)  ?  ”  This  was  how  relays 
redux... 
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